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Abstract. 

Imaging the Universe during the first hundreds of millions of years remains 
one of the exciting challenges facing modern cosmology. Observations of the 
redshifted 21 cm line of atomic hydrogen offer the potential of opening a new 
window into this epoch. This will transform our understanding of the formation 
of the first stars and galaxies and of the thermal history of the Universe. A new 
generation of radio telescopes is being constructed for this purpose with the first 
results starting to trickle in. In this review, we detail the physics that governs the 
21 cm signal and describe what might be learnt from upcoming observations. We 
also generalize our discussion to intensity mapping of other atomic and molecular 
lines. 
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1. Introduction 

Our understanding of cosmology has matured significantly over the last twenty years. 
In that time, observations of the Universe from its infancy, 400,000 years after the 
Big Bang, through to the present day, some 13.7 billion years later, have given us 
a basic picture of how the Universe came to be the way it is today. Despite this 
progress much of the first billion years of the Universe, a period when the first stars 
and galaxies formed, is still an unobserved mystery. 

Astronomers have an advantage over archaeologists in that the finite speed of 
light gives them a way of looking into the past. The further away an object is located 
the longer the light that it emits takes to reach an observer today. The image recorded 
at a telescope is therefore a picture of the object long ago when the light was first 
emitted. The construction of telescopes both on the Earth, such as Keck, Subaru and 
VLT, and in space, such as the Hubble Space Telescope, has enabled astronomers to 
directly observe galaxies out to distances corresponding to a time when the Universe 
was a billion years old. 

Added to this, observations at microwave frequencies reveal the cooling afterglow 
of the big bang. This cosmic microwave background (CMB) decoupled from the cosmic 
gas 400,000 years after the Big Bang when the Universe cooled sufficiently for protons 
and electrons to combine to form neutral hydrogen. Radiation from this time is able 
to reach us directly, providing a snapshot of the primordial Universe. 

Despite current progress, connecting these two periods represents a considerable 
challenge. Our understanding of structure is based upon the observation of small 
perturbations in the temperature maps of the CMB. These indicate that the early 
Universe was inhomogeneous at the level of 1 part in 100,000. Over time the action of 
gravity causes the growth of these small perturbations into larger non-linear structures, 
which collapse to form sheets, filaments, and halos. These non-linear structures 
provide the framework within which galaxies form via the collapse and cooling of 
gas until the density required for star formation is reached. 

The theoretical picture is well established, but the middle phase is largely untested 
by observations. To improve on this astronomers are pursuing two main avenues of 
attack. The first is to extend existing techniques by building larger, more sensitive, 
telescopes at a variety of wavelengths. On the ground, there are plans for optical 
telescopes with an aperture diameter of 24-39 meters - the Giant Magellan Telescope 
(GMT), the Thirty Meter Telescope (TMT), and the European Extremely Large 
Telescope (E-ELT) - that would be able to detect an individual galaxy out to redshifts 
z > 10. In space, the James Webb Space Telescope (JWST) will operate at infrared 
wavelengths and potentially image some of the first galaxies at z <~ 10-15. Other 
efforts involve the Atacama Large Millimeter / submillimeter Array (ALMA), which 
will observe the molecular gas that fuels star formation in galaxies during reionization 
(z = 8 — 10). These efforts target individual galaxies although the objects of interest 
are far enough away that only the brightest sources may be seen. 

This review focuses on an alternative approach based upon making observations 
of the red-shifted 21 cm line of neutral hydrogen. This 21 cm line is produced by the 
hyperfine splitting caused by the interaction between electron and proton magnetic 
moments. Hydrogen is ubiquitous in the Universe, amounting to ^75% of the gas 
mass present in the intergalactic medium (IGM). As such, it provides a convenient 
tracer of the properties of that gas and of major milestones in the first billion years 
of the Universe's history. 
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The 21 cm line from gas during the first billion years after the Big Bang redshifts 
to radio frequencies 30-200 MHz making it a prime target for a new generation of radio 
interferometers currently being built. These instruments, such as Murchison Widefield 
Array (MWA), the LOw Frequency ARray (LOFAR), the Precision Array to Probe the 
Epoch of Reionization (PAPER), the 21 cm Array (21CMA), and the Giant Meter- 
wave Radio Telescope (GMRT), seek to detect the radio fluctuations in the redshifted 
21 cm background arising from variations in the amount of neutral hydrogen. Next 
generation instruments (e.g. SKA) will be able to go further and might make detailed 
maps of the ionized regions during reionization and measure properties of hydrogen 
out to z = 30. These observations constrain the properties of the intergalactic medium 
and by extension the cumulative impact of light from all galaxies, not just the brightest 
ones. In combination with direct observations of the sources they provide a powerful 
tool for learning about the first stars and galaxies. They will also provide information 
about active galactic nuclei (AGN), such as quasars, by observing the ionized bubbles 
surrounding individual AGN. 

In addition to learning about galaxies and reionization, 21 cm observations have 
the potential to inform us about fundamental physics too. Part of the signal traces the 
density field giving information about neutrino masses and the initial conditions from 
the early epoch of cosmic inflation in the form of the power spectrum. However spin 
temperature fluctuations driven by astrophysics also contribute to the signal. Getting 
at this cosmology is a challenge, since the astrophysical effects must be understood 
before cosmology can be disentangled. One possibility is to exploit the effect of 
redshift space distortions, which also produce 21 cm fluctuations but directly trace the 
density field. In the long term, 21 cm cosmology may allow precision measurements of 
cosmological parameters by opening up large volumes of the Universe to observation. 

The goal of this review is to summarise the physics that determines the 21 cm 
signal, along with a comprehensive overview of related astrophysics. Figure [I] provides 
a summary of the 21 cm signal showing the key features of the signal with the relevant 
cosmic time, frequency, and redshift scales indicated. The earliest period of the signal 
arises in the period after thermal decoupling of the ordinary matter (baryons) from the 
CMB, so that the gas is able to cool adiabatically with the expansion of the Universe. 
In these cosmic "Dark Ages", before the first stars have formed, the first structures 
begin to grow from the seed inhomogeneties thought to be produced by quantum 
fluctuations during inflation. The cold gas can be seen in a 21 cm absorption signal, 
which has both a mean value (shown in the bottom panel) and fluctuations arising 
from variation in density (shown in the top panel). Once the first stars and galaxies 
form, their light radically alters the properties of the gas. Scattering of Lya photons 
leads to a strong coupling between the excitation of the 21 cm line spin states and 
the gas temperature. Initially, this leads to a strong absorption signal that is spatially 
varying due to the strong clustering of the rare first generation of galaxies. Next, the 
X-ray emission from these galaxies heats the gas leading to a 21 cm emission signal. 
Finally, ultraviolet photons ionize the gas producing dark holes in the 21 cm signal 
within regions of ionized bubbles surrounding groups of galaxies. Eventually all of the 
hydrogen gas, except for that in a few dense pockets, is ionized. 

Throughout this review, we will make reference to parameters describing the 
standard ACDM cosmology. These describe the mass densities in non-relativistic 
matter Vt m = 0.26, dark energy Da = 0.74, and baryons fib = 0.044 as a fraction 
of the critical mass density. We further parametrise the Hubble parameter Ho = 
100/ikms^ 1 Mpc -1 with h — 0.74. Finally, the spectrum of fluctuations is described 



CONTENTS 



5 




Figure 1. The 21-centimeter cosmic hydrogen signal, (a) Time evolution of 
fluctuations in the 21-cm brightness from just before the first stars formed through 
to the end of the reionization epoch. This evolution is pieced together from 
rcdshift slices through a simulated cosmic volume [l]. Coloration indicates the 
strength of the 21-cm brightness as it evolves through two absorption phases 
(purple and blue), separated by a period (black) where the excitation temperature 
of the 21-cm hydrogen transition decouples from the temperature of the hydrogen 
gas, before it transitions to emission (red) and finally disappears (black) owing to 
the ionization of the hydrogen gas. (b) Expected evolution of the sky-averaged 
21-cm brightness from the "dark ages" at redshift 200 to the end of reionization, 
sometime before redshift 6 (solid curve indicates the signal; dashed curve indicates 
7), = 0). The frequency structure within this redshift range is driven by several 
physical processes, including the formation of the first galaxies and the heating 
and ionization of the hydrogen gas. There is considerable uncertainty in the exact 
form of this signal, arising from the unknown properties of the first galaxies. 



by a logarithmic slope or "tilt" ns = 0.95, and the variance of matter fluctuations 
today smoothed on a scale of 8h~ x Mpc is as = 0.8. The values quoted are indicative 
of those found by the latest measurements [5] . 

The layout of this review is as follows. We first discuss the basic atomic physics 
of the 21 cm line in f|2| In Sj3j we turn to the evolution of the sky averaged 21 cm 
signal and the feasibility of observing it. In [j4]wc describe three-dimensional 21 cm 
fluctuations, including predictions from analytical and numerical calculations. After 
reionization, most of the 21 cm signal originates from cold gas in galaxies (which 
is self-shielded from the background of ionizing radiation) . In Sj5] we describe the 
prospects for intensity mapping of this signal as well as using the same technique 
to map the cumulative emission of other atomic and molecular lines from galaxies 
without resolving the galaxies individually. The 21 cm forest that is expected against 
radio bright sources is described in ^6] Finally, we conclude with an outlook for the 
future in SjT] 

We direct interested readers to a number of other worthy reviews on the subject. 
Ref. [5] provides a comprehensive overview of the entire field, and Ref. [I] takes a 
more observationally orientated approach focussing on the near term observations of 
reionization. 
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2. Physics of the 21 cm line of atomic hydrogen 

2.1. Basic 21 cm physics 

As the most common atomic species present in the Universe, hydrogen is a useful tracer 
of local properties of the gas. The simplicity of its structure - a proton and electron 
- belies the richness of the associated physics. In this review, we will be focusing on 
the 21 cm line of hydrogen, which arises from the hyperfine splitting of the IS* ground 
state due to the interaction of the magnetic moments of the proton and the electron. 
This splitting leads to two distinct energy levels separated by AE = 5.9 x 10 _6 eV, 
corresponding to a wavelength of 21.1 cm and a frequency of 1420 MHz. This frequency 
is one of the most precisely known quantities in astrophysics having been measured 
to great accuracy from studies of hydrogen masers [5] . 

The 21 cm line was theoretically predicted by van de Hulst in 1942 [6] and has 
been used as a probe of astrophysics since it was first detected by Ewen & Purcell in 
1951 7J. Radio telescopes look for emission by warm hydrogen gas within galaxies. 
Since the line is narrow with a well measured rest frame frequency it can be used 
in the local Universe as a probe of the velocity distribution of gas within our galaxy 
and other nearby galaxies. 21 cm rotation curves are often used to trace galactic 
dynamics. Traditional techniques for observing 21 cm emission have only detected the 
line in relatively local galaxies, although the 21 cm line has been seen in absorption 
against radio loud background sources from individual systems at redshifts z < 3 [HUH]. 
A new generation of radio telescopes offers the exciting prospect of using the 21 cm 
line as a probe of cosmology. 

In passing, we note that other atomic species show hyperfine transitions that 
may be useful in probing cosmology. Of particular interest are the 8.7 GHz hyperfine 
transition of 3 He + [TUHH], which could provide a probe of Helium reionization, and 
the 92 cm deuterium analogue of the 21 cm line |12j . The much lower abundance 
of deuterium and 3 He compared to neutral hydrogen makes it more difficult to take 
advantage of these transitions. 

In cosmological contexts the 21 cm line has been used as a probe of gas along the 
line of sight to some background radio source. The detailed signal depends upon the 
radiative transfer through gas along the line of sight. We recall the basic equation of 
radiative transfer for the specific intensity I v (per unit frequency v) in the absence of 
scattering along a path described by coordinate s [13] 

= -a v I v + j v (1) 

ds 

where absorption and emission by gas along the path are described by the coefficients 
a v and j„, respectively. 

To simplify the discussion, we will work in the Ray leigh- Jeans limit, appropriate 
here since the relevant photon frequencies v are much smaller than the peak frequency 
of the CMB blackbody. This allows us to relate the intensity I v to a brightness 
temperature T by the relation I v — 2ksTv 2 / c 2 , where c is the speed of light and ks is 
Boltzmann's constant. We will also make use of the standard definition of the optical 
depth r = f ds a„(s). With this we may rewrite |lj) to give the radiative transfer for 
light from a background radio source of brightness temperature Tr along the line of 
sight through a cloud of optical depth r„ and uniform excitation temperature T ex so 
that the observed temperature Tg bs at a frequency v is given by 

T b obs =T ex (l- e- T ") + T R (z/)e- T ". (2) 
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The excitation temperature of the 21 cm line is known as the spin temperature 
T s . It is defined through the ratio between the number densities of hydrogen atoms 
in the two hyperfine levels (which we label with a subscript and 1 for the IS" singlet 
and IS triplet levels, respectively) 

n>i/n = (gi/go) exp(-T*/T s ), (3) 

where (31/50) = 3 is the ratio of the statistical degeneracy factors of the two levels, 
and T* = hc/kX 2 i cm = 0.068 K. 

With this definition, the optical depth of a cloud of hydrogen is then 



ds [1 - exv{-E w /k B T s )\(Jo(t>{v)n , 



(4) 



where no — tih with tih being the hydrogen density, and we have denoted the 21 cm 
cross-section as a{v) = ao<f>(y), with a = 3c 2 A w / '8tti/ 2 , where ^4io = 2.85 x 10~ 15 s -1 
is the spontaneous decay rate of the spin-flip transition, and the line profile is 
normalised so that J <f>(y)dv =1. To evaluate this expression we need to find the 
column length as a function of frequency s(v) to determine the range of frequencies 
dv over the path ds that correspond to a fixed observed frequency fobs- This can be 
done in one of two ways: by relating the path length to the cosmological expansion 
ds = — cdz/(l + z)H(z) and the redshifting of light to relate the observed and 
emitted frequencies v y JS = v era /(l + z) or assuming a linear velocity profile locally 
v = (dv/ds)s (the well known Sobolev approximation |14j ) and using the Doppler 
law fobs = ^cm(l — v/c) self-consistently to 0(v/c). Since the latter case describes 
the well known Hubble law in the absence of peculiar velocities these two approaches 
give identical results for the optical depth. The latter picture brings out the effect of 
peculiar velocities that modify the local velocity-frequency conversion. 

The optical depth of this transition is small at all relevant redshifts, yielding a 
differential brightness temperature 



ST h = 



T s - T R 



1 



■ z 

T R 



(1-e— ) 



1 



z 



27 x m (1 + S b 

'Ts-Tr 
T s 



0.023 



d r v r 



0.15 1 + z 
fl m h 2 10 



mK, 



1/2 



(5) 
(6) 

(7) 



_{l + z)H(z) 

Here £hi is the neutral fraction of hydrogen, 5), is the fractional overdensity in baryons, 
and the final term arises from the velocity gradient along the line of sight d r v r . 

The key to the detectability of the 21 cm signal hinges on the spin temperature. 
Only if this temperature deviates from the background temperature, will a signal be 
observable. Much of this review will focus on the physics that determines the spin 
temperature and how spatial variation in the spin temperature conveys information 
about astrophysical sources. 

Three processes determine the spin temperature: (i) absorption/emission of 21 cm 
photons from/to the radio background, primarily the CMB; (ii) collisions with other 
hydrogen atoms and with electrons; and (Hi) resonant scattering of Lya photons that 
cause a spin flip via an intermediate excited state. The rate of these processes is fast 
compared to the de-excitation time of the line, so that to a very good approximation 
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the spin temperature is given by the equilibrium balance of these effects. In this limit, 
the spin temperature is given by |15j 

ji-l _ ^7 1 + x gT a 1 + x c T K 
f -\- x a ~t~ x c 

where T 7 is the temperature of the surrounding bath of radio photons, typically set 
by the CMB so that T 7 = Tomb! T a is the color temperature of the Lya radiation 
field at the Lya frequency and is closely coupled to the gas kinetic temperature Tk 
by recoil during repeated scattering, and X q . X qi cLrG the coupling coefficients due to 
atomic collisions and scattering of Lya photons, respectively. The spin temperature 
becomes strongly coupled to the gas temperature when cctot = x c + x a > 1 and relaxes 
to T 7 when x tot <C 1. 

Two types of background radio sources are important for the 21 cm line as a 
probe of astrophysics. Firstly, we may use the CMB as a radio background source. 
In this case, Tr = Tcmb and the 21 cm feature is seen as a spectral distortion to 
the CMB blackbody at appropriate radio frequencies (since fluctuations in the CMB 
temperature are small STqmb ~ 10~ 5 the CMB is effectively a source of uniform 
brightness) . The distortion forms a diffuse background that can be studied across the 
whole sky in a similar way to CMB anisotropies. Observations at different frequencies 
probe different spherical shells of the observable Universe, so that a 3D map can be 
constructed. This is the main subject of Q 

The second situation uses a radio loud point source, for example a radio loud 
quasar, as the background. In this case, the source will always be much brighter 
than the weak emission from diffuse hydrogen gas, Tr 3> Ts, so that the gas is seen 
in absorption against the source. The appearance of lines from regions of neutral 
gas at different distances to the source leads to a "forest" of lines known as the "21 
cm forest" in analogy to the Lya forest. The high brightness of the background 
source allows the 21 cm forest to be studied with high frequency resolution so probing 
small scale structures (~ kpc) in the IGM. For useful statistics, many lines of sight 
to different radio sources are required, making the discovery of high rcdshift radio 
sources a priority. We leave discussion of the 21 cm forest to S|6] 

Note that we have a number of different quantities with units of temperature, 
many of which are not true thermodynamic temperatures. Tr and ST^ are measures 
of a radio intensity. Ts measures the relative occupation numbers of the two hyperfinc 
levels. T a is a colour temperature describing the photon distribution in the vicinity of 
the Lya transition. Only the CMB blackbody temperature Tcmb and Tk are genuine 
thermodynamic temperatures. 

2.2. Collisional coupling 

Collisions between different particles may induce spin-flips in a hydrogen atom and 
dominate the coupling in the early Universe where the gas density is high. Three main 
channels are available: collisions between two hydrogen atoms and collisions between 
a hydrogen atom and an electron or a proton. The collisional coupling for a species i 

is [1313] 

x i _ Cw_ T\ _ njK\ 7\ 
Aio T 7 Aio T 7 

where Cio is the collisional excitation rate, k\ is the specific rate coefficient for spin 
deexcitation by collisions with species i (in units of cm 3 s -1 ). 
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The total collisional coupling coefficient can be written as 

HH 

c 



«f_^(T fe )n ff + K ? ff (T fe K + < H (r fe )n p , (10) 

where is the scattering rate between hydrogen atoms, Kfz is the scattering 

rate between electrons and hydrogen atoms, and k^ is the scattering rate between 
protons and hydrogen atoms. 

The collisional rates require a quantum mechanical calculation. Values for 
have been tabulated as a function of [THl E] , the scattering rate between electrons 
and hydrogen atoms was considered in Ref. [18], and the scattering rate 

between protons and hydrogen atoms was considered in Ref. [19] . Useful fitting 
functions exist for these scattering rates: the HH scattering rate is well fit in the 
range 10K < T K < 10 3 K by k h J* (T k ) 3.1 x lCT 11 ^ 357 exp(-32/T K ) cm 3 s" 1 [20]; 
and the e-H scattering rate is well fit by log(K^ /cm 3 s _1 ) = —9.607 + 0.5 log Tr- x 
exp[-(logr x ) 4 - 5 / 1 800] for T < 10 4 K and k\%{T k > 10 4 K) = K f H (10 4 K) [21]. 

During the cosmic dark ages, where the coupling is dominated by collisional 
coupling the details of the process become important. For example, the above 
calculations make use of the assumption that the collisional cross-sections are 
independent of velocity; the actual velocity dependance leads to a non-thermal 
distribution for the hyperfine occupation 22 . This effect can lead to a suppression of 
the 21 cm signal at the level of 5%, which although small is still important from the 
perspective of using the 21 cm signal from the dark ages for precision cosmology. 

2.3. Wouthuy sen- Field effect 

For most of the redshifts that are likely to be observationally probed in the near future 
collisional coupling of the 21 cm line is inefficient. However, once star formation begins, 
resonant scattering of Lya photons provides a second channel for coupling. This 
process is generally known as the Wouthuysen-Field effect [331 US] and is illustrated 
in Figure [2] which shows the hyperfine structure of the hydrogen IS and 2P levels. 
Suppose that hydrogen is initially in the hyperfine singlet state. Absorption of a Lya 
photon will excite the atom into either of the central 2P hyperfine states (the dipole 
selection rules AF = 0, 1 and no F = — > transitions make the other two hyperfine 
levels inaccessible) . From here emission of a Lya photon can relax the atom to either 
of the two ground state hyperfine levels. If relaxation takes the atom to the ground 
level triplet state then a spin-flip has occurred. Hence, resonant scattering of Lya 
photons can produce a spin-flip. 

The physics of the Wouthuysen-Field effect is considerably more subtle than this 
simple description would suggest. We may write the coupling as 

a_ 27A 10 T 7 ' 1 j 

where P a is the scattering rate of Lya photons. Here we have related the scattering 
rate between the two hyperfine levels to P a using the relation Pqi — 4P a /27, which 
results from the atomic physics of the hyperfine lines and assumes that the radiation 
field is constant across them [25] . 

The rate at which Lya photons scatter from a hydrogen atom is given by 



P a = 4:TT Xa / dv J v (v)4> a {y), (12) 
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Figure 2. Left panel: Hyperfine structure of the hydrogen atom and the 
transitions relevant for the Wouthuysen-Field effect 24.. Solid line transitions 
allow spin flips, while dashed transitions are allowed but do not contribute to 
spin flips. Right panel: Illustration of how atomic cascades convert Lyn photons 
into Lya photons. 



where a v = Xa ( t'a.{ v ) is the local absorption cross section, \a = ( 7r e 2 /m e c)/ a is the 
oscillation strength of the Lya transition, <j) a {v) is the Lya absorption profile, and 
J v (v) is the angle-averaged specific intensity of the background radiation field (by 
number) . 

Making use of this expression, we can express the coupling as 

_ iW/ a 

27AiQ± 1 m e c 

where J a is the specific flux evaluated at the Lya frequency. Here we have introduced 
S a = J dxcj) a (x) J u (x)l Joo, with being the flux away from the absorption feature, 
as a correction factor of order unity to describe the detailed structure of the photon 
distribution in the neighborhood of the Lya resonance. 

Equation ( 13 1 can b e u sed to calculate the critical flux required to produce 
x a = S a . We rewrite ^ as x a = S a J a /J% where J% = 1.165 x 10 10 [(1 + 
z)/20] cm~ 2 s _1 Hz -1 sr _1 . The critical flux can also be expressed in terms of the 
number of Lya photons per hydrogen atom J^/nn — 0.0767[(l+z)/20]~ 2 . In practice, 
this condition is easy to satisfy once star formation begins. 

The above physics couples the spin temperature to the colour temperature of the 
radiation field, which is a measure of the shape of the radiation field as a function of 
frequency in the neighbourhood of the Lya line defined by |26j 

h dlogn„ 

k B T c Av ' (LV 

where n v — c 2 J v /2v 2 is the photon occupation number. Some care must to taken 
with this definition; other definitions that do not obey detailed balance can be found 
in the literature. 

Typically, Tc ~ Tr-, because in most cases of interest the optical depth to Lya 
scattering is very large leading to a large number of scatterings of Lya photons that 
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bring the radiation field and the gas into local equilibrium for frequencies near the 
line center [57]. At the level of microphysics this relation occurs through the process 
of scattering Lya photons in the neighbourhood of the Lya resonance, which leads 
to a distinct feature in the frequency distribution of photons. Without going into 
the details, one can understand the formation of this feature in terms of the "flow" of 
photons in frequency. Redshifting with the cosmic expansion leads to a flow of photons 
from high to low frequency at a fixed rate. As photons flow into the Lya resonance 
they may scatter to larger or smaller frequencies. Since the cross-section is symmetric, 
one would expect the net flow rate to be preserved. However, each time a Lya photon 
scatters from a hydrogen atom it will lose a fraction of its energy hv /m v c 2 due to the 
recoil of the atom. This loss of energy increases the flow to lower energy and leads to a 
deficit of photons close to line center. As this feature develops scattering redistributes 
photons leading to an asymmetry about the line. This asymmetry is exactly that 
required to bring the distribution into local thermal equilibrium with Tq ~ Tk- 

The shape of this feature determines S a and, since recoils source an absorption 
feature, ensures S a < 1. At low temperatures, recoils have more of an effect and the 
suppression of the Wouthuysen-Field effect is most pronounced. If the IGM is warm 
then this suppression becomes negligible [UJ I2H1 133 122 ■ The above discussion has 
neglected processes whereby the distribution of photons is changed by spin-exchanges. 
Including this complicates the determination of T5 and Tq considerably since they 
must then be iterated to find a self-consistent solution for the level- and photon- 
populations [29]. However, the effect of spin-flips on the photon distribution is small 
< 10%. 

A useful approximation for S a is outlined in Ref. 30|: S a ~ exp(— 1.79a), where 
a = ?](3a/2nj) 1 / 3 , a = T/(4:1tAvd), T the inverse lifetime of the upper 21 cm level, 
Avd/vq — (2ksTK /mc 2 ) 1 / 2 is the Doppler parameter, vq the line center frequency, 
7 = Tqp, and rj = {hv 2 ) / \mc 2 Avn) is the mean frequency drift per scattering due 
to recoil, which is accurate at the 5% level provided that Tk > 1 K and the Gunn- 
Peterson optical depth tq P is large. 

In the astrophysical context, we will primarily be interested in photons redshifting 
into the Lya resonance from frequencies below the Ly/3 resonance. In addition, Lya 
photons can be produced by atomic cascades from photons redshifting into higher 
Lyman series resonances. These atomic cascades are illustrated in Figure [2] where 
the probability of converting a Lyn photon into a Lya photon is set by the atomic 
rate coefficients and can be found in tabular form in Refs. ^Z§\ [Mj. For large n, 
approximately 30% conversion is typical. These photons are injected into the Lya line 
rather than being redshifted from outside of the line. This changes their contribution 
to the Wouthuysen-Field coupling since the photon distribution is now one-sided. 
Similar processes to those described above apply to the redistribution of these photons, 
and they can lead to an important amplification of the Lya flux. 

This discussion gives a sense of some of the subtleties that go into to determining 
the strength of the Lya coupling. These effects can modify the 21 cm signal at the 
~10% level, which will be important as observations begin to detect 21 cm fluctuations. 
At this stage, it appears that the underlying atomic physics is understood, although 
the details of Lya radiative transfer still requires some work. 
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3. Global 21 cm signature 

3.1. Outline 

Next we examine the cosmological context of the 21 cm signal. We may express the 
21 cm brightness temperature as a function of four variables Tj = T^{Tk, x%, Ja,nn), 
where Xi is the volume-averaged ionized fraction of hydrogen. In calculating the 21 
cm signal, we require a model for the global evolution of and fluctuations in these 
quantities. Before looking at the evolution of the signal quantitatively, we will first 
outline the basic picture to delineate the most important phases. 

An important feature of Tj, is that its dependence on each of these quantities 
saturates at some point, for example once the Lya flux is high enough the spin and 
kinetic gas temperatures become tightly coupled and further variation in J a becomes 
irrelevant to the details of the signal. This leads to conceptually separate regimes 
where variation in only one of the variables dominating fluctuations in the signal. 
These different regimes can be seen in Figure [T] and are shown in schematic form in 
Figure [3] for clarity. We now discuss each of these phases in turn. 




Figure 3. Cartoon of the different phases of the 21 cm signal. The signal 
transitions from an early phase of collisional coupling to a later phase of Lya 
coupling through a short period where there is little signal. Fluctuations after 
this phase are dominated successively by spatial variation in the Lya , X-ray, 
and ionizing UV radiation backgrounds. After reionization is complete there is a 
residual signal from neutral hydrogen in galaxies. 



• 200 < z < 1100: The residual free electron fraction left after recombination 
allows Compton scattering to maintain thermal coupling of the gas to the CMB, 
setting Tk — T 1 . The high gas density leads to effective collisional coupling so 
that T$ — T-y and we expect Tf, = and no detectable 21 cm signal. 

• 40 < z < 200: In this regime, the gas cools adiabatically so that Tk oc (1 + z) 2 
leading to Tk < T 7 and collisional coupling sets T$ < T 7 , leading to TJ, < and 
an early absorption signal. At this time, Xj, fluctuations are sourced by density 
fluctuations, potentially allowing the initial conditions to be probed [3"2l |2"2"] . 

• z* < z < 40: As the expansion continues, decreasing the gas density, collisional 
coupling becomes ineffective and radiative coupling to the CMB sets T$ = T~, 
and there is no detectable 21 cm signal. 

• z Q < z < z*: Once the first sources switch on at z+, they emit both Lya photons 
and X-rays. In general, the emissivity required for Lya coupling is significantly 
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less than that for heating Tk above T 7 . We therefore expect a regime where 
the spin temperature is coupled to cold gas so that Tg ~ Tk < T 7 and there 
is an absorption signal. Fluctuations are dominated by density fluctuations and 
variation in the Lya flux [33J I2H EH] • As further star formation occurs the Lya 
coupling will eventually saturate (x a ^> 1), so that by a redshift z a the gas will 
everywhere be strongly coupled. 

• Zh < z < z Q : After Lya coupling saturates, fluctuations in the Lya flux no longer 
affect the 21 cm signal. By this point, heating becomes significant and gas 
temperature fluctuations source T\, fluctuations. While Tk remains below T 7 
we see a 21 cm signal in absorption, but as Tk approaches T 7 hotter regions may 
begin to be seen in emission. Eventually by a redshift Zh the gas will be heated 
everywhere so that Tk = T 7 . 

• zt < z < Zh: After the heating transition, Tk > T 1 and we expect to see a 21 
cm signal in emission. The 21 cm brightness temperature is not yet saturated, 
which occurs at Zt, when T$ ~ Tk 3> T 7 . By this time, the ionization fraction 
has likely risen above the percent level. Brightness temperature fluctuations are 
sourced by a mixture of fluctuations in ionization, density and gas temperature. 

• z r < z < zt: Continued heating drives Tk S> T" 7 at zt and temperature 
fluctuations become unimportant. T$ ~ Tk S> T 7 and the dependence on T$ 
may be neglected in equation |7|, which greatly simplifies analysis of the 21 cm 
power spectrum |35) . By this point, the filling fraction of HII regions probably 
becomes significant and ionization fluctuations begin to dominate the 21 cm signal 

m- 

• z < z r : After reionization, any remaining 21 cm signal originates primarily from 
collapsed islands of neutral hydrogen (damped Lya systems). 

Most of these epochs are not sharply defined, and so there could be considerable 
overlap between them. In fact, our ignorance of early sources is such that we can not 
definitively be sure of the sequence of events. The above sequence of events seems 
most likely and can be justified on the basis of the relative energetics of the different 
processes and the probable properties of the sources. We will discuss this in more 
detail as we quantify the evolution of the sources. 

Perhaps the largest uncertainty lies in the ordering of z a and Zh- Ref. [37] explores 
the possibility that Zh > z a , so that X-ray preheating allows collisional coupling to be 
important before the Lya flux becomes significant. Simulations of the very first mini- 
quasar [351 HO] also probe this regime and show that the first luminous X-ray sources 
can have a great impact on their surrounding environment. We note that these studies 
ignored Lya coupling, and that an X-ray background may generate significant Lya 



photons [34], as we discuss in { 3.5 Additionally, while these authors looked at the case 
where the production of Lya photons was inefficient, one can consider the case where 
heating is much more efficient. This can be the case where weak shocks raise the IGM 
temperature very early on |39| or if exotic particle physics mechanisms such as dark 
matter annihilation are important. Clearly, there is still considerable uncertainty in 
the exact evolution of the signal making the potential implications of measuring the 
21 cm signal very exciting. 
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3.2. Evolution of global signal 

Having outlined the evolution of the signal qualitatively, we will turn to the details 
of making quantitative predictions. In calculating the 21 cm signal it will help us to 
treat the IGM as a two phase medium. Initially, the IGM is composed of a single 
mostly neutral phase left over after recombination. This phase is characterised by a 
gas temperature Tk and a small fraction of free electrons x e . This is the phase that 
generates the 21 cm signal. 

Once galaxy formation begins, energetic UV photons ionize HII regions 
surrounding, first individual galaxies and then clusters of galaxies. These UV photons 
have a very short mean free path in a neutral medium leading to the ionized HII regions 
having a very sharp boundary (although the boundary can be softened if the ionizing 
photons are particularly hard [10]). We may therefore treat the ionized HII bubbles as 
a second phase in the IGM characterised by a volume filling fraction Xi (provided that 
the free electron fraction is small xi is approximately the mean ionization fraction). 
We will assume that these bubbles are fully ionized and that the temperature inside the 
bubbles is fixed at Thii = 10 4 K determining the collisional recombination rate inside 
these bubbles. Since the photons that redshift into the Lya resonance initially have 
long mean free paths, we may treat the Lya flux, J a as being the same in both phases 
(although in practice, since there is no 21 cm signal from the fully ionized bubbles, it 
is only the Lya flux in the mostly neutral phase that matters). To determine the 21 
cm signal at a given redshift, we must calculate the four quantities X4, x e , Tk, and 
J a . We begin by describing the evolution of the gas temperature Tk, 



Here, the first term accounts for adiabatic cooling of the gas due to the cosmic 
expansion while the second term accounts for other sources of heating/cooling j with 
Cj the heating rate per unit volume for the process j. 



Next, we consider the volume filling fraction Xi and the ionization of the neutral 
IGM x e 



In these expressions, we define Aj to be the rate of production of ionizing photons 
per unit time per baryon applied to HII regions, A e is the equivalent quantity in the 
bulk of the IGM, a a — 4.2 x 10~ 13 cm 3 s _1 is the case-A recombination coefficient at 
T = 10 4 K, as(T) is the case-B recombination rate (whose temperature dependence 
can be obtained from [41]), and C = {n 2 ) / {n e ) 2 is the clumping factor. 

Superficially these look the same, since in each case the ionization rate is a balance 
between ionizations and recombinations. The main distinction lies in the manner 
in which we treat the recombinations. In the fully ionized bubbles, recombinations 
occur in those dense clumps of material capable of self-shielding against ionizing 
radiation. These overdense regions will have a locally enhanced recombination 
rate, making it important to account for the inhomogeneous distribution of matter 
through the clumping factor C. Since recombinations will occur on the edge of these 
neutral clumps, secondary photons produced by the recombinations will likely be 




(15) 



dxi 
^dT 
dx e 
^dT 



(1 - x e )Ai - a A C'x 2 n H , 



(1 - x e )A e - a B {T)x 2 e n H - 



(16) 



(17) 
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absorbed inside the clumps rather than in the mean IGM, justifying the use of case- 
A recombination [32] • In contrast, recombinations in the bulk of the neutral IGM 
will occur at close to mean density in gas with temperature Tk- Here recombination 
radiation will be absorbed in the IGM, so we must use case-B recombination. By 
keeping track of this carefully our evolution matches onto that of RECFAST [3T] . 

This two phase approximation will eventually break down should x e become close 
to unity, indicating that most of the IGM has been ionized and that there is no clear 
distinction between ionized bubbles and a neutral bulk IGM. In most of our models, 
x e remains small until the end of reionization making this a reasonable approximation. 



3.3. Growth of HII regions 

The growth of ionized HII regions is governed by the interplay between ionization and 
recombination, both of which contain considerable uncertainties. We may write the 
ionization rate per hydrogen atom as 

A* = Ahc/cscAW^z), (18) 

with N- lon being the number of ionizing photons per baryon produced in stars, / esc the 
fraction of ionizing photons that escape the host halo, and A^ c a correction factor for 
the presence of Helium. Here p*(z) is the star formation rate density as a function 
of redshift, which is still poorly known observationally. For a present-day initial mass 
function of stars, A; on ~ 4 x 10 3 , whereas for very massive (> 1O 2 M ) stars of 
primordial composition, Ai on ~ 10 5 [43] [44] . 

We model the star formation rate as tracking the collapse of matter, so that we 
may write the star formation rate per (comoving) unit volume 

P*(z) = $/*~/coh(z). (19) 

where p® is the cosmic mean baryon density today and /+ is the fraction of baryons 
converted into stars. This formalism is appropriate for z > 10, as at later times star 
formation as a result of mergers becomes important. 

With these assumptions, we may rewrite the ionization rate per hydrogen atom 

as 

A, = CW^f, (20) 

where /colics) is the fraction of gas inside collapsed objects at z and the ionization 
efficiency parameter £ is given by 

C - A Ho /*/cscA ion . (21) 

This model for x% is motivated by a picture of HII regions expanding into neutral 
hydrogen (45) . In calculating / co ii, we use the Sheth-Tormen [46] mass function 
An I Am and determine a minimum mass m m i n for collapse by requiring the virial 
temperature r v ; r > 10 4 K, appropriate for cooling by atomic hydrogen. Decreasing 
this minimum galaxy mass, say to the virial temperature corresponding to molecular 
hydrogen cooling (~ 300 K), will allow star formation to occur at earlier times, shifting 
the features that we describe in redshift. 

The sources of ionizing photons in the early Universe are believed to have been 
primarily galaxies. However, the properties of these galaxies are currently only poorly 
constrained. Recent observations with the Hubble Space Telescope provide some 
of the best constraints on early galaxy formation. Faint galaxies are identified as 
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being at high rcdshift using a "Lya dropout technique" where a naturally occurring 
break in the galaxy spectrum at the Lya wavelength 1216A is seen in different colour 
filters as a galaxy is redshifted. So far, galaxies at redshifts up to z ~ 10 have been 
found providing information on the sources of reionization. There are unfortunately 
considerable limitations on the existing surveys owing to their small sky coverage, 
which makes it unclear whether those galaxies seen are properly representative, and the 
limited frequency coverage. Even more problematic for our purposes is that the optical 
frequencies at which the galaxies are seen do not correspond to the UV photons that 
ionize the IGM. Our limited understanding of the mass distribution of the emitting 
stars introduces an uncertainty in the number of ionizing photons per baryon AT ion is 
emitted by galaxies. There is also considerable uncertainty in the fraction of ionizing 
photons /esc that escape the host galaxy to ionize the IGM. 

The recombination rate is primarily important at late times once a significant 
fraction of the volume has already been ionized. At this stage, dense clumps within 
an ionized bubble can act as sinks of ionizing photons slowing or even stalling further 
expansion of the bubble. The degree to which gas resides in these dense clumps is 
an important uncertainty in modelling reionization. Important hydrodynamic effects, 
such as the evaporation of gas from a halo as a result of photoionization heating |47j . 
can significantly modify the clumping factor. 

A simple model for the clumping factor [32] assumes that the Universe will be 
fully ionized up to some critical overdensity A c . If the probability distribution for the 
gas density -FV(A) is specified, we may then write the clumping factor as 
f A c 

C= dAA 2 P y (A). (22) 

The quantity P^(A) can be modeled analytically starting from a consideration of 
behaviour of low density voids and accounting for Gaussian initial conditions. The 
analytic form resembles that of a Gaussian with a power law tail |42j and can be 
measured from simulations [48] . To accurately capture the clumping one should self- 
consistently perform a full hydrodynamical simulation of reionization, since thermal 
feedback can modify the gas density distribution [47] . 

To set the critical density A c , we account for the patchy nature of reionization, 
which proceeds via the expansion and overlap of ionized bubbles [49] , The size of a 
bubbles will become limited if the mean free path of ionized photons becomes shorter 
than the size of the bubble, for example if the bubble contains many small self-shielded 
absorbers. The mean free path of ionizing photons can be related to the underlying 
density field as 

K = \ [l-F v (A t )}- 2/3 . (23) 

Here Ao is an unknown normalisation constant that was found by Ref. |42j in the 
context of simulations at z — 2 — 4 to be well fit by XoH(z) — 60kms _1 . This scaling 
relationship is likely to be very approximate, but we make use of it for convenience. 
With this we can fix Aj within an ionized bubble by setting the relevant Rb = Aj(A c ). 
We then average the clumping factor over the distribution of bubble sizes (discussed 
in more details later) to get the mean clumping factor. 

3.4- Heating and ionization 



To determine the heating rate, we must integrate equation ( 15 ) and therefore we must 
specify which heating mechanisms are important. At high redshifts, the dominant 
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mechanism is Compton heating of the gas arising from the scattering of CMB photons 
from the small residual free electron fraction. Since these free electrons scatter readily 
from the surrounding baryons this transfers energy from the CMB to the gas. Compton 
heating serves to couple Tk to T 7 at redshifts z > 150, but becomes ineffective below 
that redshift. In our context, it serves to set the initial conditions before star formation 
begins. The heating rate per particle for Compton heating is given by [50] 



2 ^compton _ *£e T*y Tj( liy 



(1 + ^) 4 , (24) 



= =8.55x 10- 13 yr _1 - (25) 



3 ksn 1 + /iie + x e tj u, 7 
where /h c is the helium fraction (by number), u 7 is the energy density of the CMB, 
<tt = 6.65 x 10 _25 cm 2 is the Thomson cross-section, and we define 

m i a ? — Q KK w m-13. 
3m e c 

At lower redshifts, the growth of non-linear structures leads to other possible 
sources of heat. Shocks associated with large scale structure occur as gas separates 
from the Hubble flow and undergoes turnaround before collapsing onto a central 
overdensity. After turnaround different fluid elements may cross and shock due to the 
differential accelerations. Such turnaround shocks could provide considerable heating 
of the gas at late times [39] . 

Another source of heating is the scattering of Lya photons off hydrogen atoms, 
which leads to a slight recoil of the nucleus that saps energy from the photon. It 
was initially believed that this would provide a strong source of heating sufficient to 
prevent the possibility of seeing the 21 cm signal in absorption. Early calculations 
showed that by the time the scattering rate required for Lya photons to couple the 
spin and gas temperatures was reached, the gas would have been heated well above 
the CMB temperature [5T]. These early estimates, however, did not account for the 
way the distribution of Lya photon energies was changed by scattering. This spectral 
distortion is a part of the photons coming into equilibrium with the gas and serves to 
greatly reduce the heating rate [HI 28, 30, 31 . While Lya heating can be important 
it typically requires very large Lya fluxes and so is most relevant at late times and 
may be insufficient to heat the gas to the CMB temperature alone. 

The most important source of energy injection into the IGM is likely via X-ray 
heating of the gas [52 , 28 , 53 , 54 . While shock heating dominates the thermal balance 
in the present day Universe, during the epoch we are considering they heat the gas 
only slightly before X-ray heating dominates. For sensible source populations, Lya 
heating is mostly negligible compared to X-ray heating [3U [55] . 

Since X-ray photons have a long mean free path, they are able to heat the gas 
far from the source, and can be produced in large quantities once compact objects are 
formed. The comoving mean free path of an X-ray with energy E is [3] 

(l±£)- s (_|_) s „ pc . (26) 

Thus, the Universe will be optically thick over a Hubble length to all photons with 
energy below E ~ 2[(1 + z)/15] 1 / 2 a;J/ I 3 kcV. The E~ 3 dependence of the cross-section 
means that heating is dominated by soft X-rays, which fluctuate on small scales. In 
addition, though, there will be a uniform component to the heating from harder X- 
rays. 

X-rays heat the gas primarily through photo-ionization of HI and Hel: this 
generates energetic photo-electrons, which dissipate their energy into heating, 
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secondary ionizations, and atomic excitation. With this in mind, we calculate the 
total rate of energy deposition per unit volume as 



ex = 4%y]ni I dva v .i J v {hv - hvth,i), (27) 



where we sum over the species i =HI, Hel, and Hell, rii is the number density of 
species i, hvth = E t h is the threshold energy for ionization, ov j is the cross-section for 
photoionization, and J„ is the number flux of photons of frequency v. 

We may divide this energy into heating, ionization, and excitation by inserting 
the factor fi(y, x e ), defined as the fraction of energy converted into form i at a specific 
frequency. This allows us to calculate the contribution of X-rays to both the heating 
and the partial ionization of the bulk IGM. The relevant division of the X-ray energy 
depends on both the X-ray energy E and the free electron fraction x e and can be 
calculated by Monte-Carlo methods. This partitioning of X-ray energy in this way 
was first calculated by Ref . [SS] and subsequently updated [57] [58] . In the following 
calculations, we make use of fitting formula for the f%{v) calculated by Ref. |56| . 
which are approximately independent of v for hv > lOOeV, so that the ionization rate 
is related to the heating rate by a factor /ion /(/heat -Eth) • 

The X-ray number flux is found from 



Jx{z)= I AvJ x (v,z), (28) 
• [** , (1 + z) 2 c . _ T 

where ex(v,z) is the comoving photon emissivity for X-ray sources, and v' is the 
emission frequency at z' corresponding to an X-ray frequency v at z 

The optical depth is given by 

f z ' dl 

T(V, Z, z') = J j^dz" [n H lCrHl(^")+ n HolO'Hcl(^")+"-HcIlO'HoIl(l'")]: ( 30 ) 

where we calculate the cross-sections using the fits of [55]. Care must be taken here, as 
the cross-sections have a strong frequency dependence and the X-ray photon frequency 
can redshift considerably between emission and absorption. In practice, the abundance 
of Hell may be neglected [55] . 

X-rays may be produced by a variety of different sources with three main 
candidates at high redshifts being identified as starburst galaxies, supernova remnants 
(SNR), and miniquasars [HOI EH E3- Galaxies with high rates of star formation 
produce copious numbers of X-ray binaries, whose total X-ray luminosity can be 
considerable. Two populations of X-ray binaries may be identified in the local Universe 
distinguished by the mass of the donor star which feeds its black hole companion - 
low-mass X-ray binaries (LMXB) and high-mass X-ray binaries (HMXB). The short 
life time of HMXBs (i H MXB ~ 10 7 yr) leads the X-ray luminosity L^ MXB to track the 
star formation rate. At the same time, the longer lived LMXB (£lmxb ~ 10 10 yr) 
tracks the total mass of stars formed. Since we will focus on the early Universe and 
on the first billion years of evolution, when few LMXB are expected to have formed, 
the dominant contribution to Lx in galaxies is likely to be from HMXB 631 ■ This has 



CONTENTS 



19 



conventionally been defined in terms of a parameter fx such that the emissivity per 
unit (comoving) volume per unit frequency 



where is the star formation rate density, and the spectral distribution function is a 
power law with index as 



and the pivot energy Iii/q = 1 keV. We assume emission within the band 0.2 - 30 keV, 
and set Lq = 3.4 x 10 40 /x erg s _1 Mpc~ 3 , where fx is a highly uncertain constant 
factor [62 j . For a spectral index as — 1-5, roughly corresponding to that for starburst 
galaxies, fx = 1 corresponds to the emission of approximately 560cV for every baryon 
converted into stars. 

This normalisation was chosen so that, with fx = 1, the total X-ray luminosity 
per unit star formation rate (SFR) is consistent with that observed in starburst 
galaxies at the present epoch [631 164| . Since then improved observations have revised 
this figure owing to better separation of the contribution from LMXB and HMXB. 
While the data is still as of yet fairly patchy and shows considerable scatter fx ~ 0.2 
seems a better fit to the most recent data in the local universe [B3 E53 ■ Extrapolating 
observations from the present day to high redshift is fraught with uncertainty, and we 
note that this normalisation is very uncertain and probably evolves with redshift [67] . 
In particular, the metallicity evolution of galaxies with redshift is likely to impact 
the ratio of black holes to neutron stars that form the compact object in the HMXB 
and with it the efficiency of X-ray production. Additionally, the fraction of stars in 
binaries may evolve with redshift and is only poorly constrained at high redshifts [68] . 

Other sources of X-rays are inverse Compton scattering of CMB photons from the 
energetic electrons in supernova remnants. Estimates of the luminosity of such sources 
is again highly uncertain, but of a similar order of magnitude as from HMXB 60 . Like 
HMXB, the X-ray luminosity from supernovae remnants is expected to track the star 
formation rate. Finally, miniquasars - accretion onto black holes with intermediate 
masses in the range 10 1_5 M Q - can produce significant levels of X-rays. Since the early 
formation of black holes depends sensitively on the source of seed black holes and their 
subsequent merger history there is again considerable uncertainty. For simplicity, we 
will assume that miniquasars similarly track the star formation rate (SFR) . In reality, 
of course, their evolution could be considerably more complex [5^1 EH] • 

The total X-ray luminosity at high redshift is constrained by observations of the 
present day unresolved soft X-ray background (SXRB). An early population of X-ray 
sources would produce hard X-rays that would redshift to lower energies contributing 
to this background. Since there will be faint X-ray sources at lower redshift that also 
contribute to this background, the SXRB can be used to place a conservative upper 
limit on the amount of X-ray production at early times. This rules out complete 
reionization by X-rays but allows considerable latitude for heating [71j . 

Since heating requires considerably less energy than ionization, fx is still 
relatively unconstrained with values as high as fx ;$ 10 3 possible without violating 
constraints from the CMB polarisation anisotropies on the optical depth for electron 
scattering. Constraining this parameter will mark a step forward in our understanding 
of the thermal history of the IGM and the population of X-ray sources at high redshifts. 




(31) 




(32) 
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3. 5. Coupling 

Finally, we need to specify the evolution of the Lya flux. This is produced by stellar 
emission (J a ,*) and by X-ray excitation of HI (J a ,x)- Photons emitted by stars, 
between Lya and the Lyman limit, will redshift until they enter a Lyman series 
resonance. Subsequently, they may generate Lya photons via atomic cascades [24U29] . 
The Lya flux from stars J a ^ arises from a sum over the Lyn levels, with the maximum 
n that contributes n max ~ 23 determined by the size of the HII region of a typical 
(isolated) galaxy (see [33J for details). The average Lya background is then 

™max 

J Q »= ]T J<>*>(*), (33) 

n=2 

= 2^ Recycle (n) / dz — — — -^7ye*K,2), 



n=2 



where z max (n) is the maximum redshift from which emitted photons will redshift into 
the level n Lyman resonance, v' n is the emission frequency at z' corresponding to 
absorption by the level n at z, /recycle (n) is the probability of producing a Lya photon 
by cascade from level n, and e*(f, z) is the comoving photon number emissivity for 
stellar sources. We connect e*(z/, z) to the star formation rate in the same way as for 



X-rays in equation (32), and define e*(v) to be the spectral distribution function of 
the stellar sources. 

Stellar sources typically have a spectrum that falls rapidly above the Ly/3 
transition. We consider models with present-day (Pop. I & II) and very massive 
(Pop. Ill) stars. In each case, we take e*(v) to be a broken power law with one index 
describing emission between Lya and Ly/3, and a second describing emission between 
Ly/3 and the Lyman limit (see [21] for details). The details of the signal depend 
primarily on the total Lya emissivity and not on the shape of the spectrum (but see 
[72 [S3] for details of how precision measurements of the 21 cm fluctuations might say 
something about the source spectrum). 

For convenience, we define a parameter controlling the normalisation of the Lya 
emissivity f a by setting the total number of Lya photons emitted per baryon converted 
into stars as N a = f a N a ^ Te { where we take the reference values appropriate for normal 
(so-called, Population I & IT) stars iV a ,ref = 6590 [73J[33]. For comparison, in this 
notation, the very massive (Population IIP) stars have [43], N a — 3030 (f a — 0.46), 
when the contribution from higher Lyman series photons is included. We expect the 
value of f a to be close to unity, since stellar properties are relatively well understood. 

Photoionization of HI or Hel by X-rays may also lead to the production of 
Lya photons. In this case, some of the primary photo-electron's energy ends up in 
excitations of HI [53] , which on relaxation may generate Lya photons [5H [311 E2 • The 
rate at which Lya photons are produced ex, a can be calculated in the same way as 
the contribution to heating by X-rays, but with the appropriate change in the fraction 
of the total X-ray energy that goes into excitations rather than heating. 

Calculating the fraction of energy that goes into producing Lya photons is a 
problem that has been considered by a number of authors. Early work. [7H 156) focused 
on the amount of energy that went into atomic excitations as a whole, but we require 
only the fraction that leads to Lya production. Although excitations to the 2P level 
will always generate Lya photons, only some fraction of excitations to other levels 
will lead to Lya generating cascades. The rest will end with two photon decay 
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from the 2S level. This has been addressed by Monte-Carlo simulation of the X- 
ray scattering process using up to date cross-sections in [57] \5E\ . These simulations 
find that around p a ss 0.7 of the total energy that goes into excitation ends up as Lya 
photons, consistent with simple estimates based on the atomic cross-sections [53] . 

This Lya flux J a ^x produced by X-ray excitation may be found by balancing the 
rate at which Lya photons are produced via cascades with the rate at which photons 
redshift out of the Lya resonance [34] , giving 

47T hv a tiv a 

The relative importance of Lya photons from X-rays or directly produced by 
stars is highly dependent upon the nature of the sources that existed at high redshifts. 
Furthermore, it can vary significantly from place to place. In general, X-rays with 
their long mean free path seem likely to dominate the Lya flux far from sources while 
the contribution from stellar sources dominates closer in [34] . 



3.6. Astrophysical sources and histories 

In the above sections, we have outlined the mathematical formalism for describing the 
21 cm signal and have omitted a detailed discussion of the sources. This was deliberate; 
although we have a reasonable understanding of the physical processes involved, our 
knowledge of the properties of early sources of radiation is highly uncertain. 

Many models of galaxy formation assume that the first stars to form from the 
collapse of primordial gas are very massive (~ 10 — 100 M ) population III stars [43] . 
This is predicated on the inference that the absence of coolants more efficient than 
molecular hydrogen leads to monolithic collapse into a single massive star rather than 
fragmentation into many lower mass stars. This assumption has recently begun to be 
challenged by new numerical simulations that use "sink particles" to better follow the 
collapsing gas for many dynamical times. Such simulations show that fragmentation 
into many ~ 0.1 — 1 M Q stars may be the preferred channel of star formation [75j . 
This would naturally explain tentative observations of low mass metal-free stars |76j 
and could lead to a much higher fraction of early X-ray binaries [68 . Once earlier 
generations of star formation has enriched the IGM with metals low mass population 
II stars will begin to form due to more efficient gas cooling [77] . Different predictions 
for the mode of star formation will lead to quite different IGM histories. 

We have three radiation backgrounds to account for - ionizing UV, X-ray, and 
Lyman series photons (identified as those photons with energy 10.2 eV < E < 13.6 eV). 
For each of these radiation fields we must specify a single parameter: the ionization 
efficiency £, the X-ray emissivity fx, and the Lya emissivity f a . These parameters 
enter our model as a factor multiplying the star formation rate and are therefore 
individually degenerate with the star formation efficiency /*. This split provides a 
natural separation between the physics of the sources and the star formation rate 
and, in practice, one might imagine using observations of the star formation rate by 
other means as a way of breaking the degeneracy between them. In addition to these 
parameters, we must specify the minimum mass halo in which galaxies form M m i n and 
make use of the Sheth-Tormen mass function of dark matter halos. 

We now show results for the 21 cm global signal that explore this parameter space 
to give a sense of how the signal depends on these astrophysical parameters. Model A 
uses (iVion.iGM, fa, fx, f* ) = (200,1,1,0.1) giving z roion = 6.47 and r = 0.063. Model 
B uses (Mo„,igm, /a, fx, /*) = (600,1,0.1,0.2) giving z rcion = 9.76 and r = 0.094. 
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Model C uses (Mo„,igm, fa, fx, /*) = (3000,0.46,1,0.15) giving z roion = 11.76 and 
t = 0.115. 
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Figure 4. Top panel: Evolution of the CMB temperature Tcmb (dotted 
curve), the gas kinetic temperature Tx (dashed curve), and the spin temperature 
T$ (solid curve). Middle panel: Evolution of the gas fraction in ionized regions 
Xi (solid curve) and the ionized fraction outside these regions (due to diffuse X- 
rays) x e (dotted curve). Bottom panel: Evolution of mean 21 cm brightness 
temperature Tj>. In each panel we plot curves for model A (thin curves), model B 
(medium curves), and model C (thick curves). 1781 

Figure [4] shows several examples of the global 21 cm signal and the associated 
evolution in the neutral fraction and gas temperatures. While the details of the 
models may vary considerably, all show similar basic properties. At high rcdshift, 
10 z < 200, the gas temperature cools adiabatically faster than the CMB (since the 
residual fraction of free electrons is insufficient to couple the two temperatures). At 
the same time, collisional coupling is effective at coupling spin and gas temperatures 
leading to the absorption trough seen at the right of the lower panel. The details of 
this trough are fixed by cosmology and therefore may be predicted relatively robustly. 
The minima of this trough corresponds to the point at which collisional coupling starts 
to become relatively ineffective. 

Once star formation begins, the spin and gas temperatures again become tightly 
coupled leading to a second, potentially deeper, absorption trough. The minimum of 
this trough corresponds to the point when X-ray heating switches on heating the gas 
above the CMB temperature leading to an emission signal. The signal then reaches 
the curve for a saturated signal (Ts 3> Tcmb) briefly before the ionization of neutral 
hydrogen diminishes it. 

The ordering of these events is determined primarily by the energetics of the 
processes involved and by the basic properties of the reasonable source spectra. For 
example, ionization requires at least one ionizing photon with energy E > 13.6 eV per 
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baryon while depositing only ~ 10% of that energy per baryon would heat the gas 
to Tk > 10 4 K. However, the details of the shape of the curve after star formation 
begins are highly uncertain - in our model we have neglected any possible redshift 
evolution in the various photon emissivity parameters - but the basic structure of one 
emission feature and two absorption troughs are likely to be robust. By determining 
the positions of the various turning points in the signal one could hope to constrain 
the underlying astrophysics and learn about the first stars and galaxies. 

3. 7. Exotic heating 

One of the key points to take away from this discussion is that the 21 cm global signal 
plays the role of a very sensitive calorimeter of the IGM gas temperature. Provided 
that the coupling is saturated and that the IGM is close to neutral there is a direct 
connection between the 21 cm brightness temperature and the IGM temperature. 
Many models of physics beyond the standard model make concrete predictions for 
exotic heating of the IGM. For example, dark matter annihilation in the early Universe 
can act as a source of X-rays leading to heating. In this subsection, we consider some 
of the possibilities that have been advanced for exotic heating mechanisms and discuss 
the possibility of constraining them. 

Perhaps the most commonly considered source of heating in the dark ages is that 
of dark matter annihilation [TjU US IHH [EH IH3] ■ Dark matter is widely assumed to 
explain the observed galaxy rotation curves as well as the detailed features of the 
CMB acoustic peaks. Simple models of dark matter production and freeze out in the 
early Universe lead to a prediction for the annihilation cross-section required to leave a 
freeze out abundance corresponding to the measured value of the dark matter density 
parameter, J^dm- 

Depending on the dark matter mass, which for fixed I^dm determines the dark 
matter number density Hdm, annihilation of dark matter in the later Universe may be 
an important source of heating. It is important to note that there are two regimes in 
which dark matter annihilation may be important. Since the annihilation rate scales 
as n^y M the rate may be large at early times where tidm has yet to be diluted by the 
cosmic expansion. Alternatively, dark matter annihilation can become important once 
significant numbers of collapsed dark matter halos form, leadin to a local enhancement 
in the dark matter density [84] . 

Alternatives to dark matter annihilation include dark matter decaying into 
standard model particles or photons leading to the deposition of energy in the IGM 
[SOI [HI]- The different density dependence of dark matter decay and annihilation might 
lead to distinguishable redshift evolution of the heating. Further, models where dark 
matter contains an internal excited state that relaxes to the ground state releasing 
energy have been proposed [85] . 

Many other scenarios for exotic heating of the IGM have been put forward, 
emphasising the interest in a new technique for distinguishing models of new 
physics. Primordial black holes produced in the early Universe may evaporate after 
recombination if their masses lie in the range of 10 14 — 10 17 g [EH] and the Hawking 
radiation given off could provide a strong heating source [87] . Moving cosmic strings 
produce wakes that stir the IGM imparting heat into the gas. These were originally 
put forward as a source of density fluctuations for seeding the growth of structure. 
While ruled out for this purpose, cosmic strings might be further constrained via their 
heating effect on the IGM [EE]. 
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Incorporating the heating effect arising from exotic sources requires a knowledge of 
the energy spectrum of photons produced by the source, which must then be carefully 
processed to determine how much of the radiative energy is ultimately deposited into 
the IGM. The cross-section for photon absorption has a number of minima, which 
reflect windows at which the IGM is transparent to photons so that rather than 
being absorbed they may propagate to the present as a diffuse background. For 
most scenarios, this consideration greatly constrains the amount of energy deposited 
as heat in the IGM. Figure [5] illustrates the various processes that dominate the loss 
of energy from an energetic photon at z = 300. 
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Figure 5. Optical depths per time for various photon-IGM processes, in units of 
the Hubble time, at z = 300, assuming a neutral IGM. These include processes 
which deposit energy directly into the IGM (pair production and photoionization), 
processes which redistribute photons (27 — > 2-y) and ones that do both (Compton). 
At very low energies, photoionization is the dominant process; at very high 
energies, e± pair production dominates |86| . 



3.8. Detectability of the global signal with small numbers of dipoles 

The global 21 cm signal could potentially be measured by absolute temperature 
measurements as a function of frequency, averaged over the sky. Since the global signal 
is constant over different large patches of the sky, experimental efforts to measure it 
do not need high angular resolution and can be carried out with just a single dipole. 
The attempted measurement is complicated however by the need to remove galactic 
foregrounds, which are much larger than the desired signal. Foreground removal is 
predicated on the assumption of spectral smoothness of the foregrounds in contrast 
to the frequency structure of the signal. This should allow removal of the foregrounds 
by, for example, fitting a low order polynomial to the foregrounds leaving the 21 
cm signal in the residuals. This methodology requires very precise calibration of the 
instrumental frequency response, which could otherwise become confused with the 
foregrounds. 
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The first experimental efforts to detect the 2f cm global signal have been carried 
out by the COsmological Reionization Experiment (CORE) [H^ and the Experiment 
to Detect the Reionization Step (EDGES) [90]. These have been analysed using a 
tanh model of reionization that depends upon the redshift of reionization z r and 
its duration Az. EDGES is presently able to rule out the most rapid models of 
reionization that occur over a redshift interval as short as Az < 0.06 [91]. These 
first experimental efforts should be seen as the first steps along a road that may lead 
to considerably better constraints. Other experiments using different experimental 
approaches are underway. Some of these use individual dipoles, such as the 
Shaped Antenna measurement of the RAdio Spectrum (SARAS) (R. Subrahmanyan, 
private communications) and the Broadband Instrument for the Global HydrOgen 
ReionizatioN Signal (BIGHORNS) (S. Tingay, private communications), while others 
are exploring ways of using many dipoles as with the Large-aperture Experiment to 
Detect the Dark Ages (LEDA) (L. Greenhill, private communications). 

Theoretical estimates for the ability of a single dipole experiment to constrain 
models of the 21 cm signal can be made via the Fisher matrix formalism (52] • For a 
single dipole experiment, the Fisher matrix may be written as [9 3) 

Fij = "ff (2 + Bt- mt) diogT sky (,»)di og r sky K) ; (3g) 

n=l dPl ^ 

where tint is the total integration time (before systematics limit the performance), 
and we divide the total bandwidth B into -/V cna nnei frequency bins {v n } running 
between [i^nin, ^J. For the 21 cm global signature, our observable is the antennae 
temperature T s k y {v) = T[ g (v) + Tb(i/), where we assume the dipole sees the full sky so 
that spatial variations can be ignored. Best case errors on the parameters {pi}, which 

include both foreground and signal model parameters, are then given by < J F^ 1 . 

Such estimates show that global 21 cm experiments should be able to constrain 
realistic reionization models with Az < 2 [93, 94]. The results of integrating for 500 
hours between 100-200 MHz with a single dipole are shown in Figure [6j where the 
reionization history has been parametrized with a tanh function. 

In addition to constraining reionization, global 21 cm experiments might be used 
to probe the thermal evolution of the IGM at redshifts z > 12. Such high redshifts are 
very difficult to probe via the 21 cm fluctuations (discussed later) since they require 
very large collecting areas. Global experiments bypass this requirement, but still 
suffer from the larger foregrounds at lower frequencies. By going to high redshifts 
such experiments could place constraints on X-ray heating and Lya coupling giving 
information about when the first black holes and galaxies form, respectively. The 
absorption feature resulting from this physics can potentially be larger (~ 100 mK) 
making it a good target for observations. Figure [7] shows how the global 21 cm signal 
can vary with different values of fx and f a . Measuring the global signal would offer 
a useful avenue for distinguishing these models although there is some degeneracy 
between the two parameters. 

EDGES type experiments at frequencies v < 100 MHz are underway from the 
ground and a lunar orbiting dipole experiment - the Dark Ages Radio Experiment 
(DARE) 95J - has also been proposed. Lunar orbit offers a number of advantages 
including being shielded from terrestrial radio frequency interference (RFI) while on 
the far side of the moon and the ability to use the moon to chop the beam aiding 
instrumental calibration [SB]. DARE would be targeted at the 40-120 MHz range 
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Figure 6. 95% constraint region on a "tank" rcionization model T;,(z) = 
T21 tanh[(z — z r )/Az] of the end of reionization for an EDGES-like experiment 
assuming N po i y = 3 (solid curve), 6 (dashed curve), 9 (dotted curve), and 12 (dot- 
dashed curve). Also plotted are the 68 and 95% contours for the WMAP5 electron 
scattering optical depth constraint combined with a prior that Xi(z = 6.5) > 0.95 
(green and red coloured regions). 1931 
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Figure 7. Dependence of 21 cm signal on the X-ray (top panel) and Lya (bottom 
panel) emissivity. In each case, we consider examples with the emissivity reduced 
or increased by a factor of up to 100. Note that in our model fx and fa are 
really the product of the emissivity and the star formation efficiency. 
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ideal for measuring the deep absorption feature and determining fx and /, 



4. 21 cm tomography 

The 21 cm global signal can be viewed as a zeroth order approximation to the full 
21 cm signal, as it is averaged over large angular scales. The full 3D signal will be 
highly inhomogeneous as a result of the spatial variation in the different radiation 
fields and properties of the IGM. In this section, we consider the physics underlying 
21 cm brightness fluctuations in 3D and detail the existing techniques for calculating 
the statistical properties of the signal. 

Ultimately, one might wish to make use of fully numerical simulations of the 
relevant physics and so produce detailed maps of the 21 cm signal along thelight cone. 
At present, the large dynamic range required and the computational cost make this 
a dream for the future. For the moment, it is important to make use of a variety 
of analytic, semi-numerical, and numerical techniques to calculate the expected 21 
cm signal. These different methods complement one another in speed, accuracy, and 
detail as we describe below. 

Fluctuations in the 21 cm signal may be expanded to linear order [3] 

$T b = AA + /3 X S X + /3 a 6 a + /3 t St - Sdv, (36) 

where each 5i describes the fractional variation in the quantity i and we include 
fluctuations in the baryon density (b), neutral fraction (x), Lya coupling coefficient 
(a), gas temperature (T), and line-of-sight peculiar velocity gradient (dv). The 
expansion coefficients are given by 

Pb=1+ x fl + s V (37) 



A* = i 

Pa 



Ztot(l + Xtot) ' 



Ztot(l + Ztot)' 



Pt = ?1 + 1 f eif dlogKff + xIIH dl OgK f ff 



T K -T- t x tot {l + x tot ) \ c dlogT K c dlogTx 

In this expression, we have treated all the terms as being of a similar size, but it 
is important to realise that fluctuations in xh can be of order unity. This means 
that terms in higher order of 8 Xl which one might naively think to be small, can still 
contribute at a significant level to the power spectrum. 

In general, homogeneity and isotropy of the Universe suggest that the power 
spectrum of brightness temperature fluctuations should be spherically symmetric in 
Fourier space i.e. it should only depend on k — |k| for a wavevector k of a given 
Fourier mode. However, redshift space distortions induced by peculiar velocities break 
this symmetry since the direction to the observer becomes important and so only a 
cylindrical symmetry is preserved. This symmetry may be useful in separating signal 
from foregrounds, which typically do not share this symmetry (see e.g. Ref [97J). In 
Fourier space and at linear order, we may write the peculiar velocity term Sg v = —fi 2 8 
[55] , where /i is cosine of the angle between the line of sight and the wavevector k of 



the Fourier mode. With this, we may use (36) to form the power spectrum 
P Tb (k, m) = Pbb + P xx + P aa + Ptt + 2P bx 



CONTENTS 



28 



+ 2P ba + 2P bT + 2P xa + 2P xT + 2P aT 
+ PxSxS + other quartic terms 

+ 2H 2 (PbS + PxS + PaS + PTS) 
+ ^PS8 

~t~ 2P x fi$g vX -\- PxSovSotjX 

+ other quartic terms with 8g v . (38) 

Here we note that all quartic terms must be quadratic in xh and separate them 
depending upon whether they contain powers of 5g v or not. Those that contain powers 
of Sg v will not be isotropic and will lead to the angular dependence of Pr b (see Ref. 
[9"9"] for further discussion). 

We may rewrite Eq. ( 38 ) in more compact form 

P Tb (k^) = i>(fc) + fi 2 P^(k) + /Af>(fc) +P fM {k,ii), (39) 

where we have grouped those quartic terms with anomalous /i dependence into 
the term Pf^ ^(k,/i). In principle, high precision measurements of the 3D power 
spectrum will allow the separation of PT b (k, /i) into these four terms by their angular 
dependence on powers of p? jlOOj . The contribution of the Pf( fe)(tt )(fe, fx) term, with its 
more complicated angular dependence, threatens this decomposition [S3]. Since this 
term is only important during the final stages of reionization, we will not discuss it in 
detail in this paper noting only that the angular decomposition by powers of fi 2 may 
not be possible when ionization fluctuations are important. 

It is unclear whether the first generation of 21 cm experiments will be able to 
achieve the high signal-to-noise required for this separation [S3]. Instead, they might 
measure the angle averaged quantity 

P Tb (k) =P fl o(k) + P fl 2(k)/3 + P^(k)/5, (40) 

(where we neglect the Pf(k,^,) {k, /J,) term). One typically plots the power per 
logarithmic interval A = [fc 3 P(fc)/27r 2 ] 1 / 2 . 



4-1- Redshift space distortions 

Peculiar velocity effects can have a significant effect on the 21 cm signal. At linear 
order, the effects of peculiar velocities are well understood [SSI HOOl UOlj since the 
8q v term is simply related to the total density field. However, as the density field 
evolves and non-linear corrections to the velocity field become important the picture 
can change in ways that are not yet well understood. 

Redshift space effects become important because our observations are made in 
frequency space, while theory makes predictions most directly in coordinate space. 
The conversion between the two is affected by the local bulk velocity of the gas. We 
may write the comoving distance to an object as 

Xz ^l (l+zwr (41) 

where we introduce % as the comoving Hubble parameter. Using TL makes the notion 
compact and comes from introducing the conformal time coordinate r\ related to the 
proper time t via drj = dt/a{t), where a(t) is the usual scale factor, so that the 
comoving Hubble parameter TL — (l/a)da(ri)/dr]. 



CONTENTS 



29 



Including the effects of peculiar velocity, the true coordinate distance to an object 
with measured redshift z is 

X = X*-v(x).n/W|,. (42) 

Writing our coordinates as x = xn in real space and s = x z n in redshift space gives 
the mapping between the two as 

s = x+[v(x)-n/^]n (43) 

Accounting for this difference in the coordinate systems leads to the redshift space 
distortions. In linear theory, we have 

V-v(x) = -7tf(x), (44) 

where we are assuming the Universe to be matter dominated so that the growth 
factor / = d log D + /d log a = 1. The Fourier transform of this result introduces 
the factor of /i 2 . More generally, large peculiar velocities can lead to the so-called 
"finger of god" effects from virialised structures and greatly complicate efforts to 
separate components via the angular structure of the power spectrum |102|, 11031 1104J . 



Mistakenly attempting to use the form ( 40 1 would lead to significantly biased results 



[103 , and so new estimators calibrated by simulations are needed. 



4-2. Ionization fluctuations 

Reionization is a complicated process involving the balancing of ionizing photons 
originating in highly clustered collections of galaxies and recombinations in dense 
clumps of matter. It is perhaps surprising then that one can produce remarkably 
robust models of the topology of reionization by simply counting the number of ionizing 
photons. This basic insight lies at the center of the analytic calculation of ionization 
fluctuations [3"fJI 1105] . 

Imagine a spherical region of gas containing a total mass of ionized gas m- lon . 
As an ansatz for determining whether this region of gas will be ionized, we can ask 
whether the region contains a quantity of galaxies sufficient to ionize it. Connecting 
to the language we set up earlier when discussing the 21 cm global signal, we ask 
whether the following condition is satisfied 

"lion > C™gal, (45) 

where £ is the ionizing efficiency and m ga i is the total mass in galaxies. Note this is 
essentially the condition that the galaxies produce enough ionizing photons to have 
ionized all the gas within the region. 

This condition equates to asking if the collapse fraction exceeds a critical value 
/coil > fx = C • From the definition of the collapse fraction (making use of the 
Press-Schechter |106j mass fraction for analytic simplicity), these can be translated 
into a condition on the mass overdensity if a region is to self ionize 

S m > S x (m, z) = S c (z) - V2K(C) [<r 2 min - a 2 (m)] 1/2 , (46) 

where K(() = erf _1 (l— C -1 )- This condition to self-ionization can be used to calculate 
the probability distribution of ionized regions or bubble sizes rihubim) by reference to 
the excursion set formalism |107j . Smoothing a Gaussian density field on decreasing 
mass scales corresponds to a random walk in overdensity. Once the overdensity for 
a region crosses the ionization threshold, the mass enclosed will be ionized. This is 
similar to the mass function calculations of Press-Schcchter or Sheth-Tormen, except 
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with a mass dependent rather than constant barrier. The distribution of bubbles sizes 
found from this analytic calculation has been shown to be a good match to numerical 
reionization simulations at the same neutral fraction [1081 1109j . 

To connect the bubble distribution (a one-point statistic) to the power spectrum 
(a two-point statistic) requires extra thought. It is possible to generalise the basic 
excursion set formalism to keep track of two correlated random walks corresponding 
to two spatially separated locations. This very directly gives the two-point correlation 
function for the ionization (or density) field [1101 1111] . Unfortunately the resulting 
expressions are somewhat complicated to deal with and simpler more approximate 
calculations can be more useful. We follow Ref. [35], which incorporates some simple 
ansatzes for the form of P xx and P x $ based upon the expected clustering properties 
of the bubbles. 



4-3. Fluctuations in the coupling 

Next we consider fluctuations in the Lya coupling [33J 125 ■ Provided that we neglect 
the mild temperature dependence of S a |31j , the fluctuations in the coupling are simply 
sourced by fluctuations in the flux and we may write S a — 8j a . 

Density perturbations at redshift z' source fluctuations in J a seen by a gas element 
at redshift z via three effects. First, the number of galaxies traces, but is biased with 
respect to, the underlying density field. As a result an overdense region will contain a 
factor [1 + b(z')5] more sources, where b(z') is the (mass-averaged) bias, and will emit 
more strongly. Next, photon trajectories near an overdense region are modified by 
gravitational lensing, increasing the effective area by a factor (1 + 25/3). Finally, 
peculiar velocities associated with gas flowing into overdense regions establish an 
anisotropic redshift distortion, which modifies the width of the region contributing to a 
given observed frequency. Given these three effects, we can write 5 a = Sj a = W a (k)S, 
where we compute the window function W a ,*{k) for a gas element at z by adding the 
coupling due to Lya flux from each of the Lyn resonances and integrating over radial 
shells [33], 



1 'Wx pz mB , x (n) , r(n) 



7 " • n-1 



D{ ' ] h^ + b^)Mkr)-h 2 (kr)\, (47. 



D(z) K " Jvy ' 3 J 

where D{z) is the linear growth function, r = r(z, z') is the distance to the source, and 
the ji(x) are spherical Bessel functions of order I. The first term in brackets accounts 
for galaxy bias while the second describes velocity effects. The ratio D(z')/D(z) 
accounts for the growth of perturbations between z' and z. Each resonance con tributes 
a differential comoving Lya flux dJa/dz', calculated from equation (33). This 



analytic prescription has been bound to match later simulations fairly well j!12j . At 
present, simulations generally do not account for the full Lya radiative transfer so this 
agreement is not unexpected and comparisons to future, more detailed simulations will 
be needed. 

On large scales, W a ir (k) approaches the average bias of sources, while on small 
scales it dies away rapidly encoding the property that the Lya flux becomes more 
uniform. In addition to the fluctuations in J a there will be fluctuations in J a x- We 



CONTENTS 



31 



discuss these below, but note in passing that the effective value of W a is the weighted 
average W a = J^i W a j(J a j/J a ) of the contribution from stars and X-rays. 

This description of the coupling fluctuations neglects one important piece of 
physics. Namely, it assumes that UV photons redshift until they reach the line center 
of a Lyman series resonance and only then they scatter. At redshifts before reionization 
the Universe is filled with neutral hydrogen leading to an optical depth for scattering 
of Lya photons that is very large r a > 10 5 . While this is reduced for higher series 
transitions, the Universe is optically thick for all but the highest n transitions. This 
has the consequence that photons will preferentially scatter in the wings of the line 
and only a few will make it to line center. 

The related implications have been investigated in the literature (1131 11141 1115] . 
Since UV photons scatter in the wings of the line they will travel a significantly reduced 
distance from the source before scattering. This reduces the size of the coupled region 
around a source and acts to steepen the flux profile surrounding a source. These effects 
increase the variance in the Lya flux on small scales. This can be incorporated into our 
analytic formalism as a modification to the dJa /dz' term in (47 1 to account for the 
modified flux profile around individual sources. For our purposes, we will neglect this 
since it adds considerable numerical complication without modifying the qualitative 
features of the power spectrum. 

Once the effect of the optical depth is taken into account, another limitation of this 
approach becomes apparent. We have assumed that the photons propagate through a 
uniform IGM with mean properties. Density inhomogeneities and especially velocity 
flows may modify the propagation and scattering of UV photons leading to extra 
fluctuations that have not been accounted for here. 

This becomes especially apparent when one recalls that the sources are placed 
in ionized bubbles. This means that on small scales there are regions where there is 
no coupling because there is no neutral hydrogen. A simple way to account for this 
is by changing the lower limit of (47) to zhii, the typical size of an ionized region 
|115j . Clearly, if the patch of gas being considered is closer to a source than the HII 
region that surrounds that source then the patch will be ionized and there will be no 
signal. This leads to a reduction in power on small scales. For the sharp cutoff of 
[115] one numerically sees oscillation in the power spectrum, but in reality averaging 
over a distribution of ionized bubble sizes would remove these oscillations yielding a 
smooth reduction in power. 



4-4- Formalism for temperature and ionization fluctuations from X-rays 

We next turn to fluctuations in the gas temperature and the free electron fraction 
in the IGM. Following on from our discussion of the global history, we will assume 
that the evolution of Tk and x e is driven by the effect of X-rays and will consider the 
fluctuations St, and S e that arise from clustering of the X-ray sources. In doing so, 
we will follow the approach of Ref . [S3] . 

Before plunging into the calculation, it is worth detailing some of the issues that 
face radiative transfer of X-ray photons. The cross-section for X-ray photoionization 
depends sensitively on photon energy E, a ~ E~ 3 , so that we must keep track of 
separate energies. Most importantly, this energy dependence means that the IGM 
is optically thick for soft X-rays (E ~ 20eV), but optically thin for hard X-rays 
(E > IkeV). Moreover, heating is a continuous process and the temperature of a gas 
element depends on its past history. This is different from UV coupling where only 
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the Lya flux at a given redshift is important. We must therefore be careful to track 
the change in fluctuations in the heating and integrate these to get the temperature 
fluctuations at a later time. Insight can be gained by looking at the results of ID 
numerical simulations of X-ray radiative transfer 52 . 54J . 

The prescription we adopt here describes X-ray fluctuations produced by the 
clustering of X-ray sources. We neglect the possibility of Poisson fluctuations in the 
distribution of X-ray sources, since it is not clear how to calculate this. Poisson 
fluctuations in the Lya flux were considered in [33] . but the formalism is not readily 
applicable to heating. As a result, this formalism is most appropriate in scenarios 
where X-ray sources are relatively common and would not be appropriate for describing 



heating by extremely rare quasars [116J 

We begin by forming equations for the evolution of St and S e (the fractional 



fluctuation in x e ) by perturbing equations (15) and (17) (see also [1171 [50] ). This 
gives 

dS T 2dS sr 2A hcat ,i 

r\A fl _ J- \ _ 

oo^ = \_^Xel Ke[5 ^ ^ aAC ^ nH[6e + 6]> (49) 
dt x e 

where an overbar denotes the mean value of that quantity, and A = e/n is the 
ionization or heating rate per baryon. We note that the fluctuation in the neutral 
fraction 5 X is simply related to the fluctuation in the free electron fraction by 
8 X = -x e /(l ~ x e )S e . 

Our challenge then is to fill in the right hand side of these equations by calculating 
the fluctuations in the heating and ionizing rates. We will focus here on Compton and 



X-ray heating processes. Perturbing equation (24) we find that the contribution of 



,(50) 



Compton scattering to the right hand side of equation ( 48 ) becomes [50] 

2Aheat.C rx x i %e a~ 4 

J-K J Ik 

where 8^ is the fractional fluctuation in the CMB temperature, and we have ignored 
the effect of ionization variations in the neutral fraction outside of the ionized bubbles, 
which are small. Before recombination, tight coupling sets Tk = T 7 and St = <5t t ■ 
This coupling leaves a scale dependent imprint in the temperature fluctuations, which 
slowly decreases in time. We will ignore this effect, as it is small (~ 10%) below z = 20 
and once X-ray heating becomes effective any memory of these early temperature 
fluctuations is erased. At low z, the amplitude of fluctuations in the CMB St becomes 



negligible, and equation (50) simplifies. Our main challenge then is to calculate the 
fluctuations in the X-ray heating. We shall achieve this by paralleling the approach we 
took to calculating fluctuations in the Lya flux from a population of stellar sources. 

First, note that for X-rays <$A ion = ^A hoat = ^A Q = ^A x > as the rate of heating, 
ionization, and production of Lya photons differ only by constant multiplicative 
factors (provided that we may neglect fluctuations in x e , which are small, and focus 
on X-ray energies E > lOOeV). In each case, fluctuations arise from variation in the 
X-ray flux. We then write Sa x — Wx{k)S and obtain, 
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Ax J Eth Jz 



dz' 



D[ ' J) f " 1 b{z')]j Q {kr) -lh{kr)\ , (.11.) 



D(z) I v /JJUV ' 3" 

where the contribution to the energy deposition rate by X-rays of energy E emitted 
with energy E' from between redshifts z' and z' + dz' is given by 

dAx(g) 4vr .^dJx^z) 

where u v (E) is the cross-section for photo-ionization, E^h is the minimum energy 
threshold required for photo-ionization, and Ax is obtained by performing the energy 
and redshift integrals. Note that rather than having a sum over discrete levels, as 
in the Lya case, we must integrate over the X-ray energies. The differential X-ray 
number flux is found from equation |28| ). 

The window function Wx(k) gives us a "mask" to relate fluctuations to the 
density field; its scale dependence means that it is more than a simple bias. The 
typical sphere of influence of the sources extends out to several Mpc. On scales 
smaller than this, the shape of Wj(fc) will be determined by the details of the X-ray 
source spectrum and the heating cross-section. On larger scales, the details of the 
heated regions remain unresolved so that Wx(k) will trace the density fluctuations. 

An X-ray is emitted with energy E' at a redshift z' and redshifts to an energy 
E at redshift z, where it is absorbed. To calculate Wx we perform two integrals in 
order to capture the contribution of all X-rays produced by sources at redshifts z' > z. 
The integral over z' counts X-rays emitted at all redshifts z' > z which redshift to 
an energy E at z\ the integral over E then accounts for all the X-rays of different 
energies arriving at the gas element. Together, these integrals account for the full 
X-ray emission history and source distribution. Many of these X-rays have travelled 
considerable distances before being absorbed. The effect of the intervening gas is 
accounted for by the optical depth term in Jx- Soft X-rays have a short mean free 
path and are absorbed close to the source; hard X-rays will travel further, redshifting 
as they go, before being absorbed. Correctly accounting for this redshifting when 
calculating the optical depth is crucial as the absorption cross-section shows strong 
frequency dependence. In our model, heating is dominated by soft X-rays, from nearby 
sources, although the contribution of harder X-rays from more distant sources can not 
be neglected. 

Returning now to the calculation of temperature fluctuations, to obtain solutions 



for equations (48) and (49), we let St = gr{k,z)S, S e = g e (k,z)S, S a = W a (k, z)5, and 
Sa x = W x (k,z)5 [HI]. Since we do not assume these quantities to be independent 
of scale we must solve the resulting equations for each value of k. Note that we do 
not include the scale dependence induced by coupling to the CMB [50 . In the matter 
dominated limit, we have S oc (1 + z)~ l and so obtain 



&9t ( 9t - 2/3 



Qx(z)[W x (k) - gr] - Qc(z)g T , (53) 



dz \ 1 + z 

- Qi(z)[W x (k) - g e ] + Q R (z)[l + g e ], (54) 



dffe = ( ge 

dz \l + z 
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where we define 



J(Z)= x e (l + s)g(z) ' (55) 

Qr{z) = T^f^ (56) 
(l + 4ff(z) 

Qo{z)= ?' ( . 1 ^' ^ L , (57) 

1 + /He + t 7 -ff (2) Tr: 



and 



9 A 

6k B T K (l + z)H(z) 

These are defined so that Qr and Qi give the fractional change in x e per Hubble 
time as a result of recombination and ionization respectively. Similarly, Qc and Qx 
give the fractional change in Tk per Hubble time as a result of Compton and X-ray 
heating. Immediately after recombination Qc is large, but it becomes negligible once 
Compton heating becomes ineffective at z ~ 150. The Qr term becomes important 
only towards the end of reionization, when recombinations in clumpy regions slows the 
expansion of HII regions. Only the Qx and Qi terms are relevant immediately after 
sources switch on. We must integrate these equations to calculate the temperature 
and ionization fluctuations at a given redshift and for a given value of k. 

These equations illuminate the effect of heating. First, consider gx, which is 
related to the adiabatic index of the gas j a by gx = 7 a — 1, giving it a simple 
physical interpretation. Adiabatic expansion and cooling tends to drive gx — > 2/3 
(corresponding to j a — 5/3, appropriate for a monoatomic ideal gas), but when 
Compton heating is effective at high z, it deposits an equal amount of heat per 
particle, driving the gas towards isothermality (gx — > 0). At low z, when X-ray 
heating of the gas becomes significant, the temperature fluctuations are dominated 
by spatial variation in the heating rate (gx — > Wx)- This embodies the higher 
temperatures closer to clustered sources of X-ray emission. If the heating rate is 
uniform Wx(k) ~ 0, then the spatially constant input of energy drives the gas towards 
isothermality gx ~> 0. 

The behaviour of g e is similarly straightforward to interpret. At high redshift, 
when the IGM is dense and largely neutral, the ionization fraction is dominated by the 
recombination rate, driving g x — > — 1, because denser regions recombine more quickly. 
As the density decreases and recombination becomes ineffective, the first term of 



equation (54) gradually drives g x — > 0. Again, once ionization becomes important, 
the ionization fraction is driven towards tracking spatial variation in the ionization 
rate (g x — > Wx)- Note that, because the ionization fraction in the bulk remains less 
than a few percent, fluctuations in the neutral fraction remain negligibly small at all 
times. 

Fluctuations in the temperature gx attempt to track the heating fluctuations 
Wx(k), but two factors prevent this. First, until heating is significant, the effect of 
adiabatic expansion tends to smooth out variations in gx- Second, gx responds to the 
integrated history of the heating fluctuations, so that it tends to lag Wx somewhat. 
When the bulk of star formation has occurred recently, as when the star formation 
rate is increasing with time, then there is little lag between gx and Wx- In contrast, 
when the star formation rate has reached a plateau or is decreasing the bulk of the 



CONTENTS 



35 



X-ray flux originates from noticeably higher z and so gx tends to track the value of 
W\ at this higher redshift. On small scales, the heating fluctuations are negligible 
and qt returns to the value of the (scale independent) uniform heating case. 

4-5. Evolution of the full power spectrum 

Having described the different components of the 21 cm power spectrum, we now 
need to put them together. The 21 cm power spectrum is a 3D quantity observed as 
a function of scale and redshift, much like a movie evolving with time on a 2D screen. 
Displaying this information on a static 2D paper is therefore challenging. 

Figure [8] shows the evolution of the power spectrum as a function of redshift 
for several fixed fc-values. Four key epochs can be picked out. At early times 
z > 30 before star formation, the power spectrum rises towards a peak at z ~ 50 
and falls thereafter as the 21 cm power spectrum tracks the density field modulated 
by the mean brightness temperature. Once stars switch on, there is a period of 
complicated evolution as coupling and temperature fluctuations become important. 
Next, ionization fluctuations become important culminating in the loss of signal at 
the end of reionization. Thereafter, a weaker signal arises from the remaining neutral 
hydrogen in dense clumps that grows as structures continue to collapse. 



i/[MHz] 500 100 50 10 5 




1+z 

Figure 8. Evolution of power spectrum fluctuations. The different curves show 
P(k,z) as a function of z at fixed k for k = 0.01, 0.1, 1, 10 Mpc - 1 . Diagonal 
lines show eT{ s (u), the foreground temperature reduced by a factor e ranging from 
10~ 3 — 10 — 9 to indicate the level of foreground removal required to detect the 
signal I78| . 

Diagonal lines in Figure [8] show the amplitude of the mean foreground reduced by 
a factor ranging from 10~ 3 — 10~ 9 (note that it is really fluctuations in the foregrounds 
that set the upper limit, but this is less well known than the mean level). This gives 
a measure of the difficulty of removing the foregrounds and the sensitivity of the 



CONTENTS 



36 



instrument required for a detection. This diagonal lines serve to identify the most 
readily accessible epochs. The 10 -5 line identifies the level required to: (i) access 
density fluctuations at z < 4; (ii) access ionization fluctuations atz«6 - 12; and (Hi) 
detect fluctuations from heating and coupling at z sa 20. It is perhaps surprising that 
a similar sensitivity is needed for all three of these regimes, since the foregrounds grow 
monotonically with decreasing frequency. This is explained by the strong evolution 
in the amplitude of the signal. The fourth epoch at z ss 30 — 50 of the dark ages is 
considerably more difficult to detect, requiring the 10~ 7 level of foreground removal. 

4-6. Other sources of fluctuations 

We have focussed on fluctuations in the 21 cm signal that arise from spatial variation 
in three main radiation fields - UV, Lya , and X-rays. Other sources of fluctuations 
from the non-linear growth of structure are possible. 

Since the 21 cm signal arises from neutral hydrogen it is interesting to examine 
the densest regions of neutral hydrogen. These occur in those dense clumps that have 
achieved the critical density for collapse, but that lack the mass for efficient cooling 
of the gas that would lead to star formation. This requirement is satisfied for halos of 
with virial temperature below 10 4 K provided that only atomic hydrogen is available 
for cooling or for halos with T vir < 300K if molecular hydrogen is present. These 
minihalos should be abundant in the early Universe, although at later times external 
ionizing radiation may cause them to evaporate. Furthermore, their formation may 
be prevented with moderate gas heating that raises the Jean's mass suppressing the 
collapse of these low mass objects. The high density in these regions implies that 
collisions can provide the main coupling mechanism and due to the high temperature 
of the virialised gas these minihalos are bright in 21 cm emission |1181 11191 11201 [T2Tj . 

When baryons and photons decouple at z « 1, 100 there is a sudden drop in 
the pressure supporting the baryons against gravitational collapse and they begin to 
fall into dark matter overdensitics. It was recently realised that the relative velocity 
between the baryons and dark matter exceeds the local sound speed (which drops 
dramatically from the relativistic value of c/y/3 to ~ \JksT /m p ) leading to supersonic 
flows |122j . These flows have the potential to suppress the formation of the first 
gas clouds by preventing the baryons from collapsing into dark matter halos with 
low escape velocities |123j . This suppression of galaxy formation in minihalos may 
have important consequences for the 21 cm signal during its earliest phases - for 
example, delaying the onset of Lya coupling. It has even been suggested that the 
increased modulation of the earliest galaxies might boost the Lya coupling fluctuations 
significantly |124j . Ultimately, it appears that as the higher mass halos required for 
atomic cooling begin to collapse, the memory of this effect is greatly reduced 125, 126 . 
It is possible that by suppressing the building blocks of larger galaxies mild echos 
of this effect might be detectable in late galaxies in a similar way to the effects of 
inhomogeneous reionization |1271 11281 1129j . 

In the local Universe, shocks are known to be an important mechanism for IGM 
heating. Shocks heat the gas directly converting bulk motion into thermal energy. 
If magnetic fields are present in the shock, strong shocks can also accelerate charged 
particles. This can lead to radiative emission of photons at energies from radio to 
X-ray energies due to inverse Compton scattering of CMB photons from the charged 
particle. The resulting X-rays can further heat the IGM. At high redshift, shocks can 
be broadly divided into two categories: strong shocks around large scale structure and 
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weak shocks in the diffuse IGM due to the low sound speed of the gas. 

Large scale structure shocks occur as gas surrounding an overdense region 
decouples from the Hubble flow and undergoes turnaround. As the gas begins to 
collapse inwards derivations from spherical symmetry will lead to shell crossing and 
shocking of the gas. These shocks have been discussed both analytically -39J and 
observed in simulations [SO] 1121] . The temperature distribution of shocked gas can be 
estimated using a Press-Schecter formalism under the assumption that all gas that has 
undergone turn around has shocked and estimating the shock temperature from the 
characteristic peculiar velocity of the collapsing overdense region. These models show 
that, in the absence of X-ray heating, the thermal energy of the IGM is dominated 
by large scale shocks. Since these shocks trace the collapse of structure, they only 
become significant at redshifts z < 20. These shocks cannot play an important role in 
ionizing the IGM, since only a small fraction of the baryons participate in them and 
they generate < leV per participating baryon at z > 10 [130] . 

Finally, we note that there is one final radiation background that could in principle 
lead to fluctuations in the 21 cm signal: the diffuse radio background. In our 
calculations, we have assumed that the ambient 21 cm radiation field is dominated by 
the CMB, so that T 1 — 2cmb- This is likely to be a good assumption, but one can 
imagine a situation in which the 21 cm flux might be influenced by nearby radio bright 
sources. Since 21 cm photons have a large mean free path a diffuse radio background 
may build up in the same way as the diffuse X-ray background grows. Fluctuations in 
this radio background would arise from clustering of the radio sources. We note that 
the only period where these fluctuations would be important are in the regime where 
Lya and collisional coupling are unimportant, so that the spin temperature relaxes to 
the ambient radiation temperature T 7 . 

4-7. Simulation techniques 

In the previous sections we have focused on analytical descriptions of calculating the 21 
cm signal. These provide a framework for understanding the underlying physics that 
governs the signal. They also provide a method for rapidly exploring the dependence 
of the 21 cm power spectrum on a wide range of astrophysical parameters and 
determining their relative importance. Ultimately, the interpretation of observations 
requires comparison of data to theoretical predictions at a more detailed level. This 
necessitates the production of maps from numerical techniques. In this section we 
describe a hierarchy of different levels of numerical approximations that allow more 
quantitative comparisons. A more detailed review can be found in Ref. |131j . 

Closest to the analytic techniques described in this review are semi-numerical 
techniques. These are primarily techniques for simulating the reionization field based 
upon an extension of the excursion set formalism [107, 39l I108j . It has been realised 
that the spectrum of ionized fluctuations depends primarily on a single parameter - 
the ionized fraction x% [132 . Once Xi is fixed the ionization pattern can be calculated 
by filtering the density field on progressively smaller scales and asking if a region is 
capable of self-ionization. Only those regions capable of self-ionization are taken to be 
ionized; this amounts to photon counting. This technique forms the basis of a number 
of codes, 21cmFAST [133], SIMFAST21 [134], which can rapidly calculate the 21 cm 
signal with reasonable accuracy. To add in fluctuations in the Lya and temperature 
these codes also evaluate the equivalent of the analytic calculations from §4.4| The 
relevant expressions are convolutions of the density field with a kernel describing the 
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astrophysics and so may be rapidly evaluated on a numerical grid via fast Fourier 
transform (FFT) techniques. Other semi-numerical techniques exist such as BEARS 
|135| . which is based upon painting spherically symmetric ionization, heating, or 
coupling profiles from a library of ID radiative transfer calculations appropriately 
scaled for a halos star-formation rate and formation time onto the density field. 

Fully numerical simulations are the ultimate destination for these calculations, 
but as of yet it is difficult to achieve the required simulation volume and dynamic 
range required for the full calculation of the 21 cm signal. Numerical simulations 
can be broadly split into two classes: those that fully account for gas hydrodynamics 
and those that do not. The latter arc essentially dark matter N-body calculations 
with a prescription for painting baryons onto the simulation in a post-processing step. 
Radiative transfer prescriptions are then applied in a further post-processing step to 
calculate the evolution of the ionized regions. Such calculations |1021 [132, 136, 137 
are of great utility in describing the large scale properties of the 21 cm signal. Full 
hydrodynamic calculations [138J are typically restricted to smaller volumes making 
them unrepresentative of the cosmic volume. They have the advantage of self- 
consistently evolving the dark matter, baryons, and radiation field. This allows the 
evolution of photon sinks, in the form of mini-halos and dense clumps, to be studied 
in detail. As computers improve these simulations will grow in size. 

Finally, most of the simulation work to date has focussed on reionization and 
made the assumption that Tg 3> Tcmb ignoring the effect of spin-temperature 
fluctuations. For predicting the 21 cm signal it is important that analytic calculations 
of these T$ variation be verified numerically Work on incorporating Lya and X-ray 
propagation into reionization simulations exists, but is at an early stage of development 
[1141 11391 1140j . These calculations require keeping track of the radiative transfer of 
photons in many frequency bins and so becomes numerically expensive. 

4-8. Detectability of the 21 cm signal 

Our focus in this review is on the physics underlying the 21 cm signal, but it is 
appropriate to pause for a moment and consider the instrumental requirements for 
detecting the signal. We direct the interested reader to the review in Ref. [I], which 
covers the near term prospects for detecting the 21 cm signal from reionization in 
some detail. 

The 21 cm line from the epoch of reionization is redshifted to meter wavelengths 
requiring radio frequency observations. While a typical radio telescope is made of a 
single large dish, an interferometer composed of many dipole anntenae is the preferred 
design for 21 cm observations. Cross-correlating the signals from individual dipoles 
allows a beam to be synthesised on the sky. Using dipoles allows for arrays with very 
large collecting areas and a large field of view suitable for surveys. This comes at the 
cost of large computational demands and, driven by the long wavelengths, relatively 
poor angular resolution (~ 10 arcmin, which corresponds to the predicted bubble size 
at the middle of reionization). 

The sensitivity of these arrays is determined in part by the distribution of the 
elements, with a concentrated core configuration giving the highest sensitivity to the 
power spectrum [141] . The desire for longer baselines to boost the angular resolution in 
order to remove radio point sources places constraints on the compactness achievable 
in practice. 

The variance of a 21 cm power spectrum measurement with a radio array for a 
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single k-mode with line of sight component k\\ — fik is given by |141j : 



<Tp(k,v)= jr T T b P 2 i («, M) + T sys ^7 77TT T~ 

Afield ^ Sti n t n(fcjj \A e 




We restrict our attention to modes in the upper-half plane of the wavevector k, 
and include both sample variance and thermal detector noise assuming Gaussian 
statistics. The first term on the right-hand-side of the above expression provides 
the contribution from sample variance, while the second describes the thermal noise 
of the radio telescope. The thermal noise depends upon the system temperature T sys , 
the survey bandwidth B, the total observing time t- m t, the comoving distance D{z) 
to the center of the survey at redshift z, the depth of the survey AD, the observed 
wavelength A, and the effective collecting area of each antennae tile A e . The effect of 
the configuration of the antennae is encoded in the number density of baselines n±(k) 
that observe a mode with transverse wavenumber k± |142) . Observing a number of 
fields iVfioid further reduces the variance. 

4-9. Statistics beyond the power spectrum 

In our discussion of the 21 cm fluctuations, we have focussed on the power spectrum 
as a statistical quantity that can be measured from maps of the sky. When 
experiments lack the sensitivity to image the 21 cm fluctuations at high signal to 
noise it is important to compress the information into statistical quantities that can 
be accurately measured. If 21 cm fluctuations were Gaussian then the power spectrum 
would contain all information about the signal. However, the 21 cm signal is highly 
non-Gaussian since the presence of ionized bubbles or spheres that have been heated 
imprints definite features in space. It is therefore important to think about non- 
Gaussian statistics that can be used to extract information that is not contained in 
the power spectrum from 21 cm observations. 

The simplest form of non-Gaussianity that arises in the 21 cm signal is the 
primordial non-Gaussianity induced in the density field during inflation. This is often 
characterised by the /nl parameter defined by assuming a quadratic correction to 
the inflaton potential so that schematically <f> = 4>g + /nl<^g> with </> the full inflaton 
potential and 4>q the Gaussian potential. This form of non-Gaussianity shows up 
clearly in the bispectrum, the Fourier transform of the 3pt correlation function [143] . 
Measuring /nl is an important goal of cosmology since it can effectively distinguish 
between different classes of inflationary potential. Unfortunately, /nl is expected to be 
small (~ (1 — n s ) ~ 0.05 for slow roll inflation models) and constraints from the CMB 
and galaxy surveys are relatively weak [144) . One long term hope is that observations 
of the pristine 21 cm signal at z > 30 could lead to very stringent non-Gaussianity 
constraints, since such surveys can probe very large volumes of space |145l 1146] , 

In the near term, studies of 21 cm non-Gaussianity will be most useful for learning 
about astrophysics in more detail. Ionized bubbles during rcionization will imprint 
a particular form of non-Gaussianity on the 21 cm signal that should contain useful 
information about the sizes and topology of ionized regions. The challenge is to 
develop statistics matched to that form of non-Gaussianity, which is currently an 
unsolved problem. Some examples of statistics that have been explored are the 1 pt 
function (or probability distribution function) of the 21 cm brightness field, which 
develops a skewness as reionization leads to many pixels with zero signal [3^1 1147] ; 
the Euler characteristic [148) . which determines the number of holes in a connected 
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surface; and there are many other possibilities including the bispectrum, wavelets, and 
threshold statistics |149] that have yet to be properly explored. In addition to new 
statistics, it is possible for signatures of non-Gaussianity to show up as a modification 
to the shape of the power spectrum [150] . 

4-10. Prospects for cosmology 

In this review, we have mostly focused on the ability of the 21 cm signal to constrain 
different aspects of astrophysics and the physics that needs to be understood in order to 
do so. In the first instance the effects of the astrophysics of galaxy formation needs to 
be understood before fundamental physics can be addressed. Once this is understood 
there is considerable potential for addressing cosmology. 21 cm observations represent 
one of the only ways of accessing the large volumes and linear modes present at 
high redshifts. Studies show that 21 cm experiments have considerable potential to 
improve on measurements of cosmological parameters |99[ 1151] , on models of inflation 
[1521 1153] . and on measurements of the neutrino mass [154J. 

We have touched upon the possibility of constraining exotic heating and structure 
formation in the early Universe via the 21 cm global signal. These experiments are 
primarily sensitive to key transitions such as the onset of Lya coupling, the onset of 
heating, and the reionization of the Universe. These key moments can be affected by 
different heating mechanisms in similar ways and so most likely will provide upper 
limits on the contribution of exotic physics. Such elements as heating by dark matter 
decay or annihilation [80, 81, 82, 83 , evaporating primordial blackholes [86 , or other 
such physics might all be constrained. 

Ideally one would target the cosmic dark ages, where the 21 cm signal depends 
on well-understood linear physics. In this regime, one would have the possibility of 
making precision measurements similar to the CMB, but at many different redshifts. 
This represents the long term goal of 21 cm observations as a tool of cosmology. 
Unfortunately, many technical challenges need to be overcome before this is realistic. 
Foregrounds become much larger relative to the signal at the low frequencies relevant 
for the cosmic dark ages. This requires large experiments to achieve the required 
sensitivity. Since the sensitivity of an interferometer to the 21 cm power spectrum 
typically decreases on smaller angular scales as a result of fewer long baselines, large 
angular scales are easier to access. It was shown in |155j that an experiment with 10 
km 2 collecting area could detect a constrained isocurvature mode via its signature on 
large angular scales in the 21 cm power spectrum at z~ 30. In contrast, detecting 
the running of the primordial power spectrum via the 21 cm power spectrum on scales 
k > IMpc requires experiments with >10 km 2 to achieve the necessary sensitivity 

m- 

These low frequencies will probably require escaping the Earth's ionosphere to 
space or the lunar surface [156) . The payoff would be unprecedented precision on 
cosmological parameters and a precise picture of the earliest stages of structure 
formation. This could lend insight into fundamental physics of phenomena like 
variations in the fine-structure constant [157J and the presence of cosmic superstrings 
[158]. 

More accessible in the near future, is the EoR where the astrophysics of star 
formation must be disentangled to get at cosmology. This will likely involve the use 
of redshift space distortions and astrophysical modeling as a way of removing the 
astrophysics leaving the cosmological signal behind. In the future, with sensitivity 
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sufficient to image the 21 cm fluctuations one can imagine developing sophisticated 
algorithms to exploit the full 3D information in the maps to separate astrophysics, 
much as one currently removes foregrounds in the CMB. This has not been studied in 
great detail, but preliminary work on modelling out the contribution of reionization 
appears promising. 

It has been shown in simulations that just one parameter - the ionized fraction Xi 
- does a good job of describing the ionization power spectrum at the few percent level. 
This arises from well understood physics and amounts to a simple photon counting 
argument. Thus, at first order we might hope to be able to predict the ionization 
contribution to the 21 cm signal during reionization and marginalise over it in the 
same manner as the bias of galaxies in galaxy surveys. At higher levels of precision, 
accounting for the modifications from the clustering of the sources or sinks of ionizing 
photons will become important. Modelling of the ionization power spectrum need only 
degrade cosmological constraints by factors of a few |151j . 

5. Intensity mapping in atomic and molecular lines 

5.1. 21 cm intensity mapping and dark energy 

Reionization eliminated most of the neutral hydrogen in the Universe, but not all. 
In overdense regions the column depth can be sufficient for self-shielding to preserve 
neutral hydrogen. Examples of such pockets of neutral hydrogen are seen as damped 
Lya systems in quasar spectra. Whereas the pre-reionization 21 cm signal provides 
information about the topology of ionized region, the post-reionization signal describes 
the clustering of collapsed halos based on the underlying density field of matter. 

The notion of intensity mapping of 21 cm bright galaxies was introduced in several 
papers [1591 11601 1161] , and similar ideas were discussed by |162j . The starting point 
was the many experimental efforts to detect the 21 cm signal from the epoch of 
reionization when the IGM is close to fully neutral. After reionization approximately 
1% of the baryons are contained in neutral hydrogen (Ohi ~ 0.01). Although this 
reduces the signal significantly relative to a fully neutral IGM, the amplitude of 
Galactic synchrotron emission (which provides the dominant foreground) is several 
orders of magnitude less at the frequencies corresponding to 21 cm emission at redshifts 
z = 1 — 3. Consequently, the signal to noise ratio for radio experiments targeted 
at intensity mapping might be expected to be comparable to experiments targeted 
at the epoch of reionization (EoR). In both cases, the new technologies driven by 
developments in computing power makes possible previous unattempted observations. 

Such observations would be extremely important for our understanding of the 
Universe. It is perhaps humbling to realise that existing observations from the CMB 
and galaxy surveys fill only a small fraction of the potentially observable Universe. 
Figure [9^, shows the available comoving volume out to a given redshift. At present, 
the deepest sky survey over a considerable fraction of the sky is the first Sloan Digital 
Sky Survey (SDSS), whose luminous red galaxies (LRG) sample has a mean redshift 
of z 0.3. By measuring the 21 cm intensity fluctuations at z = 1 — 3 the comoving 
volume probed by experiment would be increased by two orders of magnitude. This is 
particularly important since our ability to constrain cosmological parameters depends 
critically on the survey volume. In general, constraints scale with volume V as 

— 1/2 

o cx V cS since a larger volume means that more independent Fourier modes can 
be constrained. For example, one might improve constraints on quantities such as the 
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Figure 9. The fraction of the total comoving volume of the observable Universe 
that is available up to a redshift z [159] . 



neutrino mass and the running of the primordial power spectrum [163J . 

Perhaps the greatest utility for intensity mapping (IM) surveys is in constraining 
dark energy via measurements of baryon acoustic oscillations (BAOs) in the galaxy 
power spectrum. BAO arise from the same physics that produces the spectacular peak 
structure in the CMB. These oscillations have a characteristic wavelength set by the 
sound horizon at recombination and so provide a "standard ruler" . Measurements of 
this distance scale can be used to probe the geometry of the Universe and to constrain 
its matter content. Measurements of the oscillations in the CMB at z « 1100 and 
more locally in the BAO at z < 3 would provide exquisite tests of the flatness of 
the Universe and of the equation of state of the dark energy. Redshifts in the range 
z«l-3 are of great interest since they cover the regime in which dark energy begins 
to dominate the energy budget of the Universe. While galaxy surveys will begin to 
probe this range in the next decade, covering sufficiently large areas to the required 
depth is very challenging for galaxy surveys. This has been considered in depth by 
the Dark Energy Task Force [164] . 

To get a sense of the observational requirements of BAO observations with 21 cm 
intensity mapping, let us consider some numbers. Beyond the third peak of the BAO 
non-linear evolution begins to wash out the signal. The peak of the third BAO peak 
has a comoving wavelength of 35/i _1 Mpc, which for adequate samples requires pixels 
perhaps half that size. At z = 1.5 the angular scale corresponding to 18/i _1 Mpc is 20 
arcmin, which sets the required angular resolution of the instrument. Estimating the 
HI mass enclosed in such a volume is uncertain, but based on the observed value of 
Q m ~ 10~ 3 is - 2 x 1O 12 M . 

The mean brightness temperature T& of the 21 cm line can be estimated using 

lO" 3 J \ 0.29 J \ 2.5 J ' 1 ' 

where 1 + 6 = p g /p g is the normalised neutral gas density and a = (1 + z)^ 1 is the 
scale factor. The amplitude of the signal is therefore considerably smaller than that 
of the 21 cm signal during reionization. However, the foregrounds are reduced by a 
factor of [(1 + 8)/(l + 1.5)] -2 ' 6 ~ 30, redeeming the situation. Diffuse foregrounds 



T b = 0.3(l + 6) 
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Figure 10. The observable parameter space in redshift and in scale (k) for 
BAO studies. The shaded regions are observationally inaccessible. The horizontal 
lines indicate the scale of the first three BAO wiggles, and the dashed lines show 
contours of constant spherical harmonic order I |161| . 



may be removed in a similar fashion to the 21 cm signal at high redshift. Figure 10 



shows the relevant region of the BAO that could be detected. The top excluded region 
accounts for the non-linear growth of structures, which erases the baryon wiggles in 
the power spectrum. The left exclusion region comes from the finite volume of the 
Universe, which limits the largest wavelength mode that fits within the survey. The 
right region is a rough estimate of the limit from foregrounds based upon a simple 
differencing of neighbouring frequency channels to remove the correlated foreground 
component. The white accessible region demonstrates that the first three BAO peaks 
could be accessible with intensity mapping experiments. 

Observations of the BAO can be used to constrain the dark energy equation of 
state w, which may be parametrised as w = w p + (a p — a)w' , where w p and a p are 
the value of the equation of state at a pivot redshift where the errors in w and w' 
are uncorrelated. Figure [IT] shows the ability of an IM telescope covering a square 
aperture of size 200 m by 200 m subdivided into 16 cylindrical sections each 12.5 
m wide and 200 m long. After integrating for 100 days with bandwidth 3MHz, this 
experiment is roughly comparable to the stage IV milestone set by the Dark Energy 
Task Force (DETF) committee [164 , and provides very good dark energy constraints. 
The power of such an experiment makes it a potential alternative to more traditional 
galaxy surveys |165l 11631 1166j . 

Reaching the thermal noise required by a signal of ^300 /iK at frequencies of 600 
MHz is within the range of existing telescopes. As a first step towards demonstrating 
that such observations are possible, Ref. |167| made use of the Green Bank Telescope 
(GBT) to observe radio spectra corresponding to 21 cm emission at 0.53 < z < 1.12 
over 2 square degrees on the sky. This raw radio data was processed by removing 
a smooth component, dominated by the foregrounds, to leave a map of intensity 
fluctuations. Cross-correlating these intensity fluctuations with a catalog of DEEP2 
galaxies in the same field showed a distinct correlation in agreement with theory. These 
experiments are continuing with the next step to understand instrumental systematics 
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Figure 11. Constraints on the equation of state parameter w = p/pc 2 (horizontal 
axis) and its redshift variation for an equation of state w = w p + (a p — a)w' (vertical 
axis) at z = 0. The 1-rr contour for IM combined with Planck (inner thick solid 
for baseline model, outer thin solid for worst case), the Dark Energy Task Force 
stage I projects with Planck (outer dotted), the stage I and III projects with 
Planck (intermediate dotted), the stage I, III, and IV projects with Planck (inner 
dotted), and all above experiments combined (dashed, again thick for baseline, 
thin for worst case; the two contours are nearly indistinguishable). [161] 



at the level required to cleanly measure the 21 cm intensity auto-power spectrum. 
They constitute an important step towards demonstrating the feasibility of intensity 
mapping with the 21 cm line. 

21 cm intensity mapping probes the neutral hydrogen contained within galaxies 
and one might reasonably worry about how robust a tracer this is of the density field. 
It is believed that damped Lya absorbers (DLAs), which contain the bulk of neutral 
hydrogen, are relatively low-mass systems and so should have relatively low bias (when 
compared to the highly biased bright galaxies seen in high redshift galaxy surveys) 
[1(30] . On top of this, however, one might worry that fluctuations in the ionizing 
background could lead to spatial variation in the neutral hydrogen content. But 
after reionization the mean free path for ionizing photons increases and the ionizing 
background should become fairly uniform with modulation only at the percent level 

5.2. Intensity mapping in other lines 

So far, we have described the possibilities for 21 cm intensity mapping. The 21 cm line 
has a number of nice qualities - it is typically optically thin, it is a line well separated 
in frequency from other atomic lines, and hydrogen is ubiquitous in the Universe. It 
is however only one line out of many that have been observed in local galaxies, which 
begs the question "is 21 cm the best line to use and what additional information 
might be gained by looking at intensity mapping in other lines?" The analysis of the 
impact of intensity mapping using lines other than the 21 cm line has not yet been 
fully explored. Because the physical conditions leading to emission by these species 
are quite varied the cross-species joint analysis of intensity maps is a complex topic 
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that the community has only begun to examine. Because of this variation, the cross- 
analysis is potentially a rich source of information on conditions at high redshift. 

One area where intensity mapping in lines other than 21 cm would be particularly 
interesting is during the epoch of reionization. One of the challenges for understanding 
the first galaxies is the difficulty of placing the galaxies seen in the Hubble Ultradeep 
Field (HUDF) into a proper context. By focusing on a small patch of sky, the HUDF 
sees very faint galaxies, but it is unclear how representative this patch is of the whole 
Universe at that time. For comparison, the full HUDF is approximately 3x3 arcmin 
in size comparable with the size of an individual ionized bubble, expected to be ~ 
ten arcminutes in diameter during the middle stages of reionization. Moreover, it is 
apparent that the galaxies seen in the HUDF are the brightest galaxies and that fainter, 
as yet unseen, galaxies contribute significantly to star formation and reionization. The 
James Webb Space Telescope (JWST) will see even fainter galaxies and transform our 
view of the galaxy population at z ~ 10, but there will still be a substantial level of 
star formation that it will not be able to resolve |168j . 

Combining these galaxy surveys with 21 cm observations and intensity mapping 
would allow a powerful synergy between three independent types of observations 
directed at understanding the first galaxies and the epoch of reionization (illustrated 
in Figure 12). Deep galaxy surveys with HST and JWST would inform us of the 
detailed properties of small numbers of galaxies during the EoR. 21 cm tomography 
provides information about the neutral hydrogen gas surrounding groups of galaxies. 
Intensity mapping fills in the gaps providing information about the total emission and 
clustering of the full population of galaxies, even those below the sensitivity threshold 
of the JWST. Together these three techniques would provide a highly complete view 
of galaxies at high redshift and transform our understanding of the origins of galaxy 
formation. 




Figure 12. Cartoon of the role intensity mapping would play in understanding 
galaxy formation. Deep galaxy surveys with HST and JWST image the properties 
of individual galaxies in small fields (blue boxes). 21 cm tomography (red filled 
region) provides a "negative space" view of the Universe by determining the 
properties of the neutral gas surrounding groups of galaxies. Intensity mapping 
(purple filled regions) fills in the gaps providing information about the collective 
properties of groups of galaxies. Together the three would give a complete view 
of the early generation of galaxies in the infant universe. 



CONTENTS 



46 



The first steps towards understanding intensity mapping in molecular lines 
were made by Righi, Hernandez-Monteagudo & Sunyaev [169] . who considered the 
possibility that redshifted emission from CO rotational lines might contribute a 
foreground to CMB experiments. They showed that cross-correlating CMB maps 
of differing spectral resolution at frequencies of v w 30 GHz could be used to constrain 
the CO emission of galaxies at z ~ 3 — 8. This might be done, for example, by 
correlating CMB maps from CBI with those of the Planck LFI. These ideas were 
focussed on the notion of 2D maps, but contain the seed of later work. 
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Figure 13. Ratio between line luminosity, L, and star formation rate, M*, for 
various lines observed in galaxies and taken from Table 1 of Ref. |170l . For the 
first 7 lines this ratio is measured from a sample of low redshift galaxies. The 
other lines have been calibrated based on the galaxy M82. Some weaker lines, for 
example for HCN, have been omitted for clarity. 



Ref. [171] explored for the first time the notion of looking at a variety of lines 
in 3D intensity maps. Figure fl3] shows the major lines that appear in the emission 
spectrum of M82. By making a map at the appropriate frequency any of these lines 
could be studied by intensity mapping. Two major challenges arise. The first is that 
continuum foregrounds are typically larger than the signal from these lines by 2-3 
orders of magnitude. This is an identical, although more tractable problem, as occurs 
in the case of 21 cm studies of the EoR and has been studied in considerable detail. 
Studies |142l 11721 1173] show that, provided the continuum foreground is spectrally 
smooth in individual sky pixels, it can be removed leaving very little residuals in the 
cleaned signal. 

Potentially more challenging is the issue of line confusion. If we look for the CO(T 
0) line (rest frame frequency of 115 GHz) in a map made at 23 GHz (corresponding to 
emission by CO at z — 4) then our map will additionally consider emission from other 
lines in galaxies at other redshifts (e.g. CO(2-l) from galaxies at z = 9). However, 
the contaminating emission arises from different galaxies which opens the possibility 
of combining maps at different frequencies corresponding to different lines from the 
same galaxies as a way of isolating a particular redshift. The emission from lines in the 
same galaxies will correlate, while emission from lines in galaxies at different redshifts 
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Figure 14. A slice from a simulated realization of line emission from galaxies 
at an observed wavelength of 441^tm (left) and 364/^m (right) 170 . The slice 
is in the plane of the sky and spans 250 X 250 comoving Mpc 2 with a depth 
of Au/u = 0.001. The colored squares indicate pixels which have line emission 
greater than 200Jy/Sr for the left panel and 250Jy/Sr for the right panel. The 
emission from OI(63/^m) and OIII(52^tm) is shown in red on the left and right 
panels, respectively, originating from the same galaxies at z = 6. All of the lines 
illustrated in Figure [T3] are included and plotted in blue. Cross correlating data 
at these two observed wavelengths would reveal the emission in OI and OIII from 
2 = 6 with the other emission lines being essentially uncorrelated. 



will not[|] 

Fortunately, it is possible to statistically isolate the fluctuations from a particular 
redshift by cross correlating the emission in two different lines |171[ If 70j . If one 
compares the fluctuations at two different wavelengths, which correspond to the same 
redshift for two different emission lines, the fluctuations will be strongly correlated. 
However, the signal from any other lines arises from galaxies at different redshifts which 
are very far apart and thus will have much weaker correlation (see Figure 14 ) . In this 
way, one can measure either the two-point correlation function or power spectrum of 
galaxies at some target redshift weighted by the total emission in the spectral lines 
being cross correlated. 

The cross power spectrum at a wavenumber k can be written as, 

Pi,20) = S!S 2 PP(k) + Pshot, (61) 

where S\ and S2 are the average fluxes in lines 1 and 2 respectively, b is the average 
bias factor of the galactic sources, P(k) is the matter power spectrum, and P s hot is 
the shot-noise power spectrum due to the discrete nature of galaxies. The root-mean- 
square error in measuring the cross power spectrum at a particular fc-mode is given 

by nm, 

8P?,2 = \i P h + ^ltotalfWal), (62) 



f This is similar to the suggestion of using the 21 cm map as a template to detect the deuterium 
hyperfine line |12j . 
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Figure 15. The cross power spectrum of OI(63 fim) and OIII(52 /im) at 
2 = 6 measured from mock simulation data for a hypothetical infrared space 
telescope similar to SPICA |170] . The solid line is the cross power spectrum 
measured when only line emission from galaxies in the target lines is included. 
The points are the recovered power spectrum when detector noise, contaminating 
line emission, galaxy continuum emission, and dust in our galaxy and the CMB 
are included. The error bars follow Eq. ( |02| with Pitotal an d f^total calculated 
from the simulated data, including detector noise, contaminating line emission 
and sample variance. 



where Pitotai and ^2totai are the total power spectra corresponding to the first line 
and second line being cross correlated. Each of these includes a term for the power 
spectrum of contaminating lines, the target line, and detector noise. Figure [l5| shows 
the expected errors in the determination of the cross power spectrum using the 01(63 
/zm) and 0111(52 fim) lines at a redshift z — 6 for an optimized spectrometer on a 3.5 
meter space-borne infrared telescope similar to SPICA, providing background limited 
sensitivity for 100 diffraction limited beams covering a square on the sky which is 1.7° 
across (corresponding to 250 comoving Mpc) and a redshift range of Az = 0.6 (280 
Mpc) with a spectral resolution of (Av/v) = 10~ 3 and a total integration time of 
2 x 10 6 seconds. 

We emphasize that one can measure the line cross power spectrum from galaxies 
which are too faint to be seen individually over detector noise. Hence, a measurement 
of the line cross power spectrum can provide information about the total line emission 
from all of the galaxies which are too faint to be directly detected. One possible 
application of this technique would be to measure the evolution of line emission over 
cosmic time to better understand galaxy evolution and the sources that reionized the 
Universe. Changes in the minimum mass of galaxies due to photoionization heating of 
the intergalactic medium during reionization could also potentially be measured |171) . 
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Figure 16. The global mean CO brightness temperature as a function of 
redshift I175| . The curves show the volume-averaged CO brightness temperature 
for several different values of M CO:m j n , the minimum halo mass hosting a CO 
luminous galaxy. In all cases, the minimum host halo mass of star forming 
galaxies is the atomic cooling mass, M s { m i„ ~ 10 s Mq, and so the curves with 
larger M co m i n describe models in which galaxies with low star formation rates 
(SFRs) are not CO luminous. If we had instead varied M s f m i n , while fixing both 
A^co.min = A^af.min an d the SFR density at a given redshift, the model variations 
would be considerably smaller. 



As a definite example of the sort of information that might be gathered in 
molecular line intensity mapping, we will discuss CO, the only line other than the 
21 cm line to have been somewhat investigated [174111751 1176] . CO emission is widely 
used as a tracer for the mass of cold molecular gas in a galaxy. This cold molecular 
gas provides the fuel for star formation and so determining its abundance provides a 
key constraint on models of star formation and galaxy evolution. Indeed one of the 
key science goals for ALMA is to directly image CO emission in individual galaxies at 
z rj 7. Figure 16 shows the redshift evolution of the mean brightness temperature (a 
measure of the radio intensity) for the CO(2-l) line as a function of the minimum mass 
required for a galaxy to be CO bright. These calculations identify a signal strength of 
0.01 — 1 /LtK at z = 8 as the observational target. This represents an order of magnitude 
increase in sensitivity over existing CMB experiments like CBI and might be enabled 
by improved techniques with dishes exploiting focal plane arrays. 

Figure 17 shows the evolution of the auto-power spectrum of the CO intensity 
fluctuations. This signal evolves strongly with redshift due to its dependence on (Too) 
and has a shape that depends on the relative contribution of the clustering of CO bright 
galaxies and a Poisson shot noise term, which is determined by the distribution of CO 
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Figure 17. The auto power spectrum of CO brightness temperature fluctuations 
|175| . The black solid, red dotted, red dashed, blue dashed, and cyan dot-dashed 
curves show simulated CO power spectra at different redshifts for various values 
of M co . m in and fixed duty cycle. The green solid line is the halo model prediction 
for the signal for z = 7.3, M co m i n = 1O 1O M0, and /duty = 0.1. The green 
dashed line shows the Poisson term, while the green dot-dashed curve represents 
the clustering term. 



galaxy masses. Measurement of the shape and evolution of this power spectrum would 
therefore give considerable information about the properties of early CO galaxies. 

5.3. Cross-correlation of molecular and 21 cm intensity maps 

Earlier, we described 21 cm tomography observations with instruments like MWA, 
LOFAR, and PAPER that hope to map the 21 cm emission from neutral hydrogen 
during the EoR. These maps trace out the ionization field, which is determined by 
the distribution of galaxies. Regions with many galaxies will tend to have ionized 
the surrounding IGM leading to a "hole" in the 21 cm signal. In contrast, since 
those same galaxies will have undergone considerable star formation and so produced 
metals, regions with galaxies will appear CO bright. This is clearly seen in the bottom 
two panels of Figure [18] - blue CO bright regions correspond to black patches with 
no signal in the 21 cm map. This anti-correlation can be detected statistically and 
provides a very direct constraint on the size of the ionized regions. Maps made in 
lines from different atoms or molecules might be used in a related way. Each atomic 
species will reflect different properties of the host galaxies and combining maps may 
enable a detailed understanding of the host metallicity and internal properties to be 




Figure 18. Redshift slices at z ss 7 taken from the simulation of Ref. |175l and 
illustrating the galaxy distribution (top right), ionization field (top left), 21 cm 
fluctuations (bottom left), and CO intensity fluctuations (bottom right). These 
last two fields have been smoothed to an angular resolution of 6 arcmin. The 
CO bright galaxies seen in the CO intensity map are also responsible for ionizing 
the neutral gas. This can be seen by the correlation between bright CO regions 
and dark (ionized) patches in the 21 cm ma p. The associated signal provides a 
quantitative version of the cartoon in Figure |12| I175| 

The model used for these calculations was based upon an empirical calibration 
of the CO luminosity function to observations of galaxies at z < 3 [175] . This was 
then extrapolated to higher redshifts as a best guess at how CO galaxies might evolve. 
More satisfying would be to connect the CO luminosity to the intrinsic properties 
of the galaxy interstellar medium (ISM) . Such modeling of the detailed properties of 
CO emission from galaxies has been attempted |177j . CO emission in local galaxies 
is known to originate in giant molecular clouds (GMCs) whose typical size is ~10 
pc. These clouds are observed to be optically thick to CO emission and so the 
empirically determined scaline of the CO luminosity with molecular hydrogen mass, 
Lqo oc Mh 2 , is best explained by the non-overlap of these clouds in angle and velocity. 
The relationship then results from the CO luminosity being proportinal to the number 
of clouds. In addition, the detailed physics of the heating of these clouds by stars and 
active galactic nuclei (AGN), which determines the excitation state of the different 
CO rotational levels, must be accounted for. There is increased interest in numerical 
simulation of the chemistry and properties of GMCs |178j . which can inform semi- 
analytic modelling. Connecting the global properties of CO emission to such ISM 
details is challenging, but would be greatly rewarding in providing a new way of 
learning about the ISM of galaxies at high redshift. Importantly the chemistry that 
determines the CO abundance may change dramatically in the denser metal poor ISM 
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of early galaxies |179j . 

Clearly, there is interesting information to be harvested from measurements of 
intensity fluctuations in line emission from galaxies at high redshifts. In the above 
discussion we have focussed on preliminary results from the study of CO, but similar 
results apply to the array of other atomic and molecular lines. An important question 
is how the signal might best be observed and which lines are most important. Some 
lines, like CO, can be targeted from the ground but others, such as 01(63 (an), lie in 
the infra-red and must be observed from space. The detailed design of the optimal 
instrument and the requirements for sensitivity and angular resolution are not yet 
well understood. Furthermore, the statistical tools for removing continuum and line 
contamination efficiently need to be developed. 

A future space borne infrared telescope such as SPICA could be ideal for intensity 



mapping. As illustrated in Fig. 15 |170j . an optimized instrument on SPICA could 
measure the 01(63 fim) and 0111(52 /zm) line signals very accurately out to high 
redshifts. Such an instrument would require the capability to take many medium 
resolution spectra at adjacent locations on the sky. However, it may be possible to 
sacrifice angular resolution since the IM technique does not involve resolving individual 
sources. 



6. 21 cm forest 

While most interest in the redshifted 21 cm line has focused on the case where the CMB 
forms a backlight, an important alternative is the case where the 21 cm absorption 
is imprinted on the light from radio loud quasars. Historically, the first attempts 
to detect the extra-galactic 21 cm line were of this nature [180] . The resulting one 
dimensional absorption spectra are expected to show a forest of lines originating from 
absorption in clumps of neutral hydrogen along the line of sight and have been called 
the 21 cm forest by analogy to the Lya forest. 

Whereas the Lya forest is primarily visible at redshifts z < 6, when the Universe 
is mostly ionized, the 21 cm forest is strongest at higher redshifts where there is 
a considerable neutral fraction. Additionally, since the 21 cm line is optically thin 
and does not saturate, the 21 cm forest can contain detailed information about 
the properties of the IGM as well as collapsed gaseous halos. There is a clear 
complementarity to the two forests for probing the evolution of the IGM over a wide 
range of redshifts. 

The 21 cm forest differs from the emission signal of the diffuse medium in a 
number of important ways. First, because the brightness temperature of radio loud 
quasars is typically ~ 10 11_12 K the 21 cm line is always seen in absorption against 
the source. The strength of the absorption feature depends upon the 21 cm optical 
depth 

, , _ 3 h p c 3 A 10 xmnn(z) / r „n. 
T " o[Z) ~ 32ir k B vl r s (l + *)(d»||/dr||)' ( ' 

w 0.009(1 + 5)(l + zf' 2 ^. 

Ts 

We see from this expression that the decrement depends primarily upon the neutral 
fraction and spin temperature. In contrast to the case where the CMB is the backlight, 
there is no saturation regime at large values of Tg. The decrement is maximised for 
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a fully neutral and cold IGM - heating or ionizing the gas will reduce the observable 
signature. 

This signature may show several distinct features: (i) A mean intensity decrement 
blueward of the 21 cm restframe frequency whose depth depends on the mean IGM 
optical depth at that redshift; (ii) small-scale variations in the intensity due to 
fluctuations in the density, neutral fraction and temperature of the IGM at different 
points along the line of sight; (in) transmission windows due to photoionized bubbles 
along the line of sight; and (iv) deep absorption features arising from the dense neutral 
hydrogen clouds in dwarf galaxies and minihalos [1811 1182] , Figure 19 shows an 
example of a 21 cm forest spectrum in which a number of these features can be 
seen. 




Figure 19. Simulated spectrum from 128 to 131 MHz of a source with brightness 
5(120 MHz) = 20 mjy at z = 10 using a model spectrum based on that of Cygnus 
A and assuming HI 21 cm absorption by the IGM. Thermal noise has been added 
using the specifications of the square kilometer array (SKA) and assuming 10 
days integration with 1 kHz wide spectral channels. The solid line is the model 
spectrum without noise or absorption I181| . 



Since the evolution of the optical depth depends on the mean neutral fraction 
and the spin temperature, we can understand the evolution of the 21 cm forest based 
on Figure |4j At early times, the IGM is fully neutral and the evolution is dictated by 
the spin temperature. Tg tracks the gas kinetic temperature and rises from tens of 
K to thousands of K by the end of reionization, causing r VQ to fall by several orders 
of magnitude due to heating alone. Then, as reionization takes place and the neutral 
fraction drops from xh — 1 to the xh ~ 10 seen in the Lya forest, the optical depth 
drops even further. Tracing the evolution of the mean optical depth would provide 
a useful constraint on the thermal evolution of the IGM and give a clear indication 
of the end of reionization. Figure 20 shows the evolution of r„ in a model where 
Tg = Ttf showing the very significant evolution in the optical depth with redshift. 

Detecting the mean decrement in the 21 cm forest may be challenging since it 
is relatively weak and requires detailed fitting of the unabsorbed continuum level. A 
potentially more robust method is to exploit the statistics of individual features. As 
reionization occurs, the appearance of ionized bubbles will show up in the forest as 
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Figure 20. Mean 21cm optical depth as a function of redshift, for varying fx 
values: from top to bottom, fx = (no x-ray heating), 0.01, 0.1, 1 (thick black 
line), 10, and 100. Also included are lines of S m i n (dashed purple lines) as denned 
in \65\ . These lines indicate, for each value of r on the left-hand axis, the minimum 
observed flux density of a source that would allow a detection at signal-to-noise 
of 5 of the flux decrement due to absorption, assuming an array with effective 
area A e ff = 10 6 m 2 , frequency resolution Au c ^ = 1 kHz, system temperature 
T aya = 100K(^/200MHz)- 2 ' 8 , and integration time t int = 1 week. When the 
mean optical depth lines (solid) cross above the S m j n lines (dashed), the flux 
decrement is detectable at the corresponding redshift 11831 . 



an increasing number of windows of near total transparency. If these lines could be 
resolved, the distribution of equivalent widths would give a measure of the process of 
reionization [THU EEM QH] ■ 

It is possible that the forest does not have to be fully resolved and could be 
analysed statistically instead. For example, the appearance of ionized regions will 
lead to a change in the variance of the signal in different frequency bins. This has 
been shown to be an effective discriminant of the end of reionization |183j and has the 
advantage of not requiring narrow features to be resolved in frequency. 

The signal from the diffuse IGM leads to a relatively low optical depth, as seen in 



( 63 1 . The signal from minihalos can be considerably stronger |1821 11181 1185] . These 
are structures that have collapsed and virialised, but because of their low mass to 
not reach the temperature of 10 4 K required for atomic hydrogen cooling. Their high 
density contrast and relatively low temperature can lead to optical depths as high as 
t ~ 0.1 within a frequency width of Av ~ 2 kHz |186j . The mean overdensity required 
to give an optical depth r is given by 



2 / ,~ \ 3 



The number density of minihalos is sensitive to the thermal properties of the IGM, 
which if heated may raise the Jeans mass sufficiently to prevent the collapse of the 
least massive minihalos that dominate the signal in the forest. Detailed descriptions 
of line profiles and their statistics have been considered based upon analytic models 
of the density distribution around halos [1821 11871 11851 1188] and in high resolution 
simulations [121] . Even a single line of sight showing such structures could be used to 
provide information on early halo formation. 
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Unfortunately, the sensitivity required to detect the 21 cm forest is challenging. 
We can consider the thermal noise required to directly detect the flux decrement 
arising from the mean absorption. Using the relationship S v — 2ksT a /A e to relate 
the antennae temperature to the flux sensitivity for a single polarisation and the 
radiometer equation gives the minimum source brightness for the decrement to be 
visible 



where we have assumed that both polarisation states are measured. The value of 
A c tf/T sys used is appropriate for SKA giving a sense of the challenge in detecting 
the forest. The system temperature is typically set by the sky temperature and is 
determined by galactic foregrounds. 

The major uncertainty in the utility of the 21 cm forest is the existence of radio 
loud sources at high redshifts. The most promising set of sources are radio loud 
quasars. We are currently ignorant of the number density of radio-loud quasars at 
high redshifts. Quasars have currently been observed to redshifts as high as z = 7.1 
|189j . but their number density drops rapidly at redshifts z > 3 [190] . Furthermore, 
the radio loud fraction of quasars is poorly understood and may evolve at higher 
redshifts [191] . Models calibrated to the number of counts at low redshifts lead to 
predictions of ~ 2000 sources across the sky with S > 6 Jy at 8 < z < 12 |192) . Many 
of these sources would show up in existing radio surveys, but it is currently impossible 
to distinguish these high-z radio bright quasars from low-z radio galaxies. Future large 
area surveys with NIR photometry, such as EUCLID or WFIRST could enable the 
identification of these high-z quasars providing targets for the SKA [193J. Gamma ray 
bursts (GRBs) and hypernovae may provide alternative high-z sources |194l 11951 1196] . 
but are believed to have a lower maximum brightness at the low frequencies of interest 
because of synchrotron self-absorption. GRBs do however have the virtue of already 
having been observed in the desired redshift range. 

This rapid decline in radio-bright quasars is unfortunate since the signal is most 
strong early on before ionization and heating have begun significantly. In any case, it 
seems likely that only a few sight lines to radio bright objects will become available 
for 21 cm forest observations. Since these observations are affected by totally different 
systematics than 21 cm tomography, they could be very important for identifying the 
properties of the IGM in the early Universe. 

7. Conclusions and outlook 

Cosmology in the 21st century is engaged in the process of pushing the boundaries 
of sensitivity and completeness of our understanding of the Universe. Observations 
of the redshifted 21 cm line offer a new window into the properties of the Universe 
at redshifts z — 1 — 150 filling in a crucial gap in observations of the period where 
the first structures and stars had formed. The 21 cm signal depends upon physics on 
different scales from the atomic physics of hydrogen, through the astrophysics of star 
formation, to the physics of cosmological structures. Through this dependence the 
21 cm signal has enormous potential to improve our understanding of the Universe. 
Over the next decade, radio experiments will begin the process of exploiting 21 cm 
observations and will reveal how much of this potential can be realised. 
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In this review, we have explored four key ways in which 21 cm observations might 
be exploited to learn about cosmology and astrophysics. Low angular resolution 
observations with small numbers of radio dipoles can measure the sky averaged 
"global" 21 cm signal providing information about key milestones in the history of 
the Universe. Understanding these moments when the first radiation backgrounds 
illuminated intergalactic space will place constraints on the properties of the first 
galaxies and limit the contribution of more exotic physics. Experiments involving 
just a handful of people, such as EDGES and CoRE, have taken the first steps along 
this path. There is currently a growing effort to using different radio dipole designs 
and attempting to combine the benefits of interferometers with individual absolutely 
calibrated dipoles. There is a significant challenge in separating a relatively smooth 21 
cm signal from smooth galactic foregrounds and much hard work ahead before these 
experiments yield robust results. 

More ambitiously, collections of thousands of dipoles, digitally combined into 
radio interferometers will have the raw sensitivity to measure fluctuations in the 21 
cm signal, providing detailed information about how the Universe was heated and 
ionized. The first generation of such instruments - LOFAR, MWA, PAPER, and 
21CMA - have or soon will be completed and data is starting to become available. 
These first instruments will peer through a hazy window of low sensitivity, but should 
determine when the epoch of reionization took place. Looking further ahead, the SKA 
will see this period clearly mapping the ionized structures and providing information 
about heating and coupling of the IGM during the formation of the first galaxies. The 
combination of 21 cm observations with observations of the CMB and with surveys of 
high redshift galaxies and information from the Lyman alpha forest should enable the 
reionization history and details of reionization to be inferred |197l 1198] . 

21 cm observations also have the potential to constrain fundamental cosmology 
and represent a path to a new level of precision cosmology. Exploiting 21 cm 
observations for fundamental cosmology will be a challenging task and it is unclear 
how much can realistically be hoped for. During the EoR astrophysics mixes with 
cosmology in the 21 cm fluctuations and until the performance of instruments is better 
understood it is not clear whether these two can be separated. In the further future, 
21 cm observations of the cosmic dark ages provide a long term hope that most of the 
volume of the Universe could one day be mapped and used for cosmology. 

Applying similar techniques at lower redshifts allows for intensity mapping of the 
neutral hydrogen in galaxies, providing an alternative to traditional redshift surveys 
of galaxies in the optical-infrared bands. In contrast to the traditional approach, only 
the cumulative line emission from many galaxies over large volumes is being mapped, 
and galaxies are not being resolved individually. This intensity mapping technique 
could help shed light on the properties of dark energy through the use of baryon 
acoustic oscillations as a standard ruler to measure the expansion of the Universe. 
Further, intensity mapping in other molecular and atomic lines can complement 21 
cm tomography allowing a complete picture of the formation of stars and metals to 
be developed. 

Finally, observations of individual radio bright sources, such as quasars, might 
allow observations of the 21 cm forest. This provides another window into the 
properties of the IGM at the end of reionization that depends on very different 
systematics than 21 cm tomography. The 21 cm forest would allow the small scale 
properties of the IGM to be studied in great detail and so constrain the properties of 
dark matter. This is a powerful technique and the main uncertainty is the abundance 
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of target sources. Large area NIR surveys should begin to answer this question soon, 
as more high redshift quasars are found and followed up to determine the radio loud 
fraction. Just a handful of sufficiently bright radio sources at z > 7 would make this 
a very valuable probe of reionization and the IGM. 

21 cm cosmology is a new field with much yet to be discovered, but it is one 
with great potential for the future. Much of this potential stems from breakthroughs 
in computing power which make "digital radio astronomy" feasible and allow for 
extremely large radio interferometers. It is to be hoped that in the decades to come 21 
cm observations will transform our understanding of the cosmic dawn and the epoch 
of reionization pushing farther our detailed understanding of the cosmos. 
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